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CDR-GRAFTED ANTI-TISSUE FACTOR 
1 ANTIBODIES AND METHODS OF USE THEREOF 

FIELD OF THE INVENTION 

5 Monoclonal antibodies capable of inhibiting 

tissue factor (TF) are useful as anticoagulants. 
Conventional rodent monoclonal antibodies, however, have 
limited use in human therapeutic and diagnostic 
applications due to immunogenicity and short serum half- 
life. The present invention provides CDR-grafted 
monoclonal antibodies against TF that retain the high 
binding affinity of rodent antibodies but have reduced 
immunogenicity. The present humanized antibodies are 
potent anticoagulants and are thus useful in the 
15 treatment and prophylaxis of human thrombotic disease. 
The invention also provides methods of making the CDR- 
grafted antibodies and pharmaceutical compositions for 
the attenuation or prevention of coagulation. 

20 BACKGROUND OF THE INVENTION 

The coagulation of blood involves a cascading 
series of reactions leading to the formation of fibrin* 
The coagulation cascade consists of two overlapping 

25 pathways, both of which are required for heraostasis. 

The intrinsic pathway comprises protein factors present 
in circulating blood, while the extrinsic pathway 
requires tissue factor, which is expressed on the cell 
surface of a variety of tissues in response to vascular 

30 injury. Davie et al. , 1991, Biochemistry 30: 10363. 

Agents that interfere with the coagulation cascade, such 
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as heparin and coumarin derivatives, have well-known 
1 therapeutic uses in the prophylaxis of venous 
thrombosis. Goodman and Gilman, eds . , 1980, The 
Pharmacological Basis of Therapeutics , MacMillan 
Publishing Co., Inc., New York. 
5 Tissue factor (TF) has been investigated as a 

target for anticoagulant therapy. TF is a membrane 
glycoprotein that functions as a receptor for factor VII 
and Vila and thereby initiates the extrinsic pathway of 
the coagulation cascade in response to vascular injury. 

10 In addition to its role in the maintenance of hemostasis 
by initiation of blood clotting, TF has been implicated 
in pathogenic conditions. Specifically, the synthesis 
and cell surface expression of TF has been implicated in 
vascular disease (Wilcox et al. , 1989, Proc. Natl. Acad. 

15 Sci. 86:2839) and gram-negative septic shock (Warr et 
al., 1990, Blood 75:1481) . 

Ruf et al. (1991, Thrombosis and Haemostasis 
j56:529) characterized the anticoagulant potential of 
murine monoclonal antibodies against human TF. The 

20 inhibition of TF function by most of the monoclonal 
antibodies that were assessed was dependent upon the 
dissociation of the TF/VIIa complex that is rapidly 
formed when TF contacts plasma. Such antibodies were 
thus relatively slow inhibitors of TF in plasma. One 

25 monoclonal antibody, TF8-5G9, was capable of inhibiting 
the TF/VIIa complex without dissociation of the complex, 
thus providing an immediate anticoagulant effect in 
plasma. Ruf et al. suggest that mechanisms that 
inactivate the TF/VIIa complex, rather than prevent its 

30 formation, may provide strategies for interruption of 
coagulation in vivo . 
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The therapeutic use of monoclonal antibodies 
1 against TF is limited in that currently available 
monoclonals are of rodent origin. The use of rodent 
antibodies in human therapy presents numerous problems, 
the most significant of which is immunogenicity . 
5 Repeated doses of rodent monoclonal antibodies have been 
found to elicit an anti-immunoglobulin response termed 
human anti-mouse antibody (HAMA) , which can result in 
immune complex disease and/or neutralization of the 
therapeutic antibody. See, e.g. , Jaffers et al. (1986) 
10 Transplantation 41:572. While the use of human 

monoclonal antibodies would address this limitation, it 
has proven difficult to generate large amounts of human 
monoclonal antibodies by conventional hybridoma 
technology. 

15 Recombinant technology has been used in an 

effort to construct "humanized" antibodies that maintain 
the high binding affinity of rodent monoclonal 
antibodies but exhibit reduced immunogenicity in humans. 
Chimeric antibodies have been produced in which the 

20 variable (V) region of a mouse antibody is combined with 
the constant (C) region of a human antibody in an effort 
to maintain the specificity and affinity of the rodent 
antibody but reduce the amount of protein that is non- 
human and thus immunogenic. While the immune response 

25 to chimeric antibodies is generally reduced relative to 
the corresponding rodent antibody, the immune response 
cannot be completely eliminated, because the mouse V 
region is capable of eliciting an immune response. 
Lobuglio et al. (1989) Proc. Natl. Acad. Scl. 86:4220; 

30 Jaffers et al. (1986) Transplantation 41:572. 
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In a recent approach to reducing 
1 immunogenicity of rodent antibodies, only the rodent 
complementarity determining regions (CDRs), rather than 
the entire V domain , are transplanted to a human 
antibody. Such humanized antibodies are known as CDR- 
5 grafted antibodies. CDRs are regions of 

hypervariability in the V regions that are flanked by 
relatively conserved regions known as framework (FR) 
regions. Each V domain contains three CDRs flanked by 
four FRs. The CDRs fold to form the antigen binding 

10 site of the antibody, while the FRs support the 

structural conformations of the V domains. Thus by 
transplanting the rodent CDRs to a human antibody, the 
antigen binding domain can theoretically also be 
transferred. Owens et al. (1994) J. Immunol. Methods 

15 168:149 and Winter et al. (1993) Immunology Today 14:243 
review the development of CDR-grafted antibodies. 

Orlandi et al. (1989) Proc. Natl. Acad. Sci. 
USA £6:3833 constructed a humanized antibody against the 
relatively simple hapten nitrophenacetyl (NP) . The CDR- 

20 grafted antibody contained mouse CDRs and human FRs, and 
exhibited NP binding activity similar to the native 
mouse antibody. However, the construction of CDR- 
grafted antibodies recognizing more complex antigens has 
resulted in antibodies having binding activity 

25 significantly lower than the native rodent antibodies. 
In numerous cases it has been demonstrated that the mere 
introduction of rodent CDRs into a human antibody 
background is insufficient to maintain full binding 
activity, perhaps due to distortion of the CDR 

30 conformation by the human FR. 
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For example, Gorman et al. (1991) Proc. Natl. 
L Acad. Sci. 88:4181 compared two humanized antibodies 
against human CD4 and observed considerably different 
avidies depending upon the particular human framework 
region of the humanized antibody. Co et al. (1991) 
5 Proc. Natl. Acad. Sci. USA 88:2869 required a refined 
computer model of the murine antibody of interest in 
order to identify critical amino acids to be considered 
in the design of a humanized antibody. Kettleborough et 
M- (1991) Protein Engineering 4:773 report the 
influence of particular FR residues of a CDR-grafted 
antibody on antigen binding, and propose that the 
residues may directly interact with antigen, or may 
alter the conformation of the CDR loops. Similarly, 
Singer et al. (1993) J. Immunol. 150:2844 report that 
15 optimal humanization of an anti-CD18 murine monoclonal 
antibody is dependent upon the ability of the selected 
FR to support the CDR in the appropriate antigen binding 
conformation. Accordingly, recreation of the antigen- 
binding site requires consideration of the potential 
intrachain interactions between the FR and CDR, and 
manipulation of amino acid residues of the FR that 
maintain contacts with the loops formed by the CDRs. 
While general theoretical guidelines have been proposed 
for the design of humanized antibodies ( see, e.g. . Owens 
£t al . ) , in all cases the procedure must be tailored and 
optimized for the particular rodent antibody of 
interest. 

There is a need in the art for humanized 
antibodies with reduced immunogenicity and comparable 
binding affinity relative to the parent rodent antibody 
for various therapeutic applications. In particular, 
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there is a need for a humanized antibody against human 
1 tissue factor having anticoagulant activity and useful 
in the treatment and prevention of thrombotic disease. 

SUMMARY OF THE INVENTION 

5 

The present invention is directed to CDR- 
grafted antibodies capable of inhibiting human tissue 
factor wherein the CDRs are derived from a non-human 
monoclonal antibody against tissue factor and the FR and 

10 constant (C) regions are derived from one or more human 
antibodies. In a preferred embodiment, the murine 
monoclonal antibody is TF8-5G9. 

In another embodiment, the present invention 
provides a method of producing a CDR-grafted antibody 

15 capable of inhibiting human tissue factor which method 
comprises constructing one or more expression vectors 
containing nucleic acids encoding CDR-grafted antibody 
heavy and light chains, transfecting suitable host cells 
with the expression vector or vectors, culturing the 

20 transfected host cells, and recovering the CDR-grafted 
antibody . 

The present invention also provides a method 
of attenuation of coagulation comprising administering a 
CDR-grafted antibody capable of inhibiting human tissue 
25 factor to a patient in need of such attenuation. 

The present invention further provides a 
method of treatment or prevention of thrombotic disease 
comprising administering a CDR-grafted antibody capable 
of inhibiting human tissue factor to a patient in need 
30 of such treatment or prevention. In a preferred 
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embodiment , the thrombotic disease is intravascular 
1 coagulation, arterial restenosis or arteriosclerosis. 

Another embodiment of the present invention is 

directed to a pharmaceutical composition comprising CDR- 

grafted antibodies capable of inhibiting human tissue 
5 factor and further comprising a pharmaceutically 

acceptable carrier. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Fig. 1 provides the nucleotide and deduced 

amino acid sequences of the heavy chain of murine 
monoclonal antibody TF8-5G9. 

Fig. 2 provides the nucleotide and deduced 
amino acid sequences of the light chain of murine 

15 monoclonal antibody TF8-5G9. 

Fig. 3 is a graph depicting the ability of 
CDR-grafted antibody TF8HCDR1 x TF8LCDR1 to bind to 
human tissue factor and to compete with murine 
monoclonal antibody TF85G9 for binding to tissue factor. 

20 Solid symbols indicate direct binding of TF8HCDR1 x 
TF8LCDR1 and the positive control chimeric TF85G9 to 
tissue factor. Open symbols indicate competition 
binding of TF8HCDR1 x TF8LCDR1 or chimeric TF85G9 with 
murine monoclonal antibody TF85G9. 

2 5 Fig. 4 presents the DNA sequence of expression 

vector pEe6TF8HCDR20 and the amino acid sequence of the 
coding regions of the CDR-grafted heavy chain TF8HCDR20. 

Fig. 5 presents the DNA sequence of expression 
vector pEel2TF8LCDR3 and the amino acid sequence of the 

30 coding regions of the CDR-grafted light chain TF8LCDR3. 
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Fig. 6 is a graph depicting the ability of 
1 CDR-grafted antibody TF8HCDR20 x TF8LCDR3 to bind to 
human tissue factor. 

Fig. 7 is a graph depicting the ability of 
CDR-grafted antibody TF8HCDR20 x TF8LCDR3 to compete 
5 with murine monoclonal antibody TF85G9 for binding to 
tissue factor. 

Fig. 8 is a graph depicting the ability of 
CDR-grafted antibody TF8HCDR20 x TF8LCDR3 to inhibit 
factor X activation. 
10 Fig. 9 provides expression vector 

pEe6TF8HCDR20 resulting from the subcloning of CDR- 
grafted heavy chain TF8HCDR20 into myeloma expression 
vector pEehCMV-Bqll. The following abbreviations are 
used: VH is the CDR-grafted heavy chain variable 
.15 region; Cy4 is the human IgG4 constant region; pA is the 
polyadenylation signal; ampR is the fl-lactamase gene; 
and hCMV is human cytomegalovirus. 

Fig. 10 provides expression vector 
pEe 1 2TF8LCDR3 resulting from the subcloning of CDR- 
20 grafted light chain TF8LCDR3 into myeloma expression 

vector pEel2. The following abbreviations are used: VL 
is the CDR-grafted light chain variable region; CK is 
the human kappa constant region; SVE is the SV40 early 
promoter; GS is glutamine synthetase cDNA. Other 
25 abbreviations are as noted in Fig. 9. 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention provides CDR-grafted 
30 antibodies capable of inhibiting human tissue factor 

wherein the CDRs are derived from a non-human monoclonal 
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antibody against tissue factor and the FR and C regions 
1 are derived from one or more human antibodies. The 

present invention further provides methods of making and 
using the subject CDR-grafted antibodies. 

In accordance with the present invention, the 
5 CDR-grafted antibody is an antibody in which the CDRs 
are derived from a non-human antibody capable of binding 
to and inhibiting the function of human tissue factor, 
and the FR and c regions of the antibody are derived 
from one or more human antibodies. The CDRs derived 

10 from the non-human antibody preferably have from about 
90% to about 100% identity with the CDRs of the non- 
human antibody, although any and all modifications, 
including substitutions, insertions and deletions, are 
contemplated so long as the CDR-grafted antibody 

15 maintains the ability to bind to and inhibit tissue 

factor. The regions of the CDR-grafted antibodies that 
are derived from human antibodies need not have 100% 
identity with the human antibodies. In a preferred 
embodiment, as many of the human amino acid residues as 

20 possible are retained in order than immunogenicity is 
negligible, but the human residues, in particular 
residues of the FR region, are substituted as required 
and as taught hereinbelow in accordance with the present 
invention. Such modifications as disclosed herein are 

25 necessary to support the antigen binding site formed by 
the CDRs while simultaneously maximizing the 
humanization of the antibody. 

Non-human monoclonal antibodies against human 
tissue factor from which the CDRs can be derived are 

30 known in the art (Ruf et al., 1991; Morrisey et al • , 

19 88/ Thrombosis Research 52:247) or can be produced by 
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well-known methods of monoclonal antibody production 
1 (see, e.g. Harlow et al . , eds . , 1988, Antibodies, A 

Laboratory Manual , Cold Spring Harbor Laboratories, Cold 
Spring Harbor, New York). Purified human tissue factor 
against which monoclonal antibodies can be raised is 
5 similarly well-known (Morrisey et al., 1987, Cell 

50:129) and available to the skilled artisan. Murine 
monoclonal antibodies, and in particular murine 
monoclonal antibody TF8-5G9 disclosed by Ruf et al. and 
Morrisey et al . , 1988, Thrombosis Research 52:247, and 

10 U.S. Patent No. 5,223,427 are particularly preferred. 

The ordinarily skilled artisan can determine 
the sequences of the CDRs by reference to published 
scientific literature or sequence databanks, or by 
cloning and sequencing the heavy and light chains of the 

15 antibodies by conventional methodology. In accordance 
with the present invention, the cDNA and amino acid 
sequences of the heavy chain (SEQ ID NOS:l and 2, 
respectively) and light chain (SEQ ID NOS:3 and 4, 
respectively) of murine monoclonal antibody TF8-5G9 are 

20 provided. The cDNA and deduced amino acid sequence of 
the murine TF8-5G9 heavy chain is provided at Figure 1. 
The cDNA and deduced amino acid sequence of the murine 
TF8-5G9 light chain is provided at Figure 2. 

Each of the heavy and light chain variable 

25 regions contain three CDRs that combine to form the 

antigen binding site. The three CDRs are surrounded by 
four FR regions that primarily function to support the 
CDRs. The sequences of the CDRs within the sequences of 
the variable regions of the heavy and light chains can 

30 be identified by computer-assisted alignment according 
to Kabat et al. (1987) in Sequences of Proteins of 
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Inununoloqical Interest . 4th ed., United States 
1 Department of Health and Human Services , US Government 
Printing Office, Washington, D.C., or by molecular 
modeling of the variable regions, for example utilizing 
the ENCAD program as described by Levitt (1983) J. Mol. 
5 Biol. 168:595. 

In a preferred embodiment the CDRs are derived 
from murine monoclonal antibody TF8-5G9. The preferred 
heavy chain CDRs have the following sequences: 

10 CDR1 DDYMH (SEQ ID NO: 5) 

CDR2 LIDPENGNTIYDPKFQG (SEQ ID NO: 6) 

CDR3 DNSYYFDY (SEQ ID NO: 7) 

The preferred light chain CDRs have the following 
15 sequences : 

CDR1 KASQDIRKYLN (SEQ ID NO: 8) 

CDR2 YATSLAD (SEQ ID NO: 9) 

CDR3 LQHGESPYT (SEQ ID NO: 10) 

20 

The sequences of the CDRs of the murine or other non- 
human antibody, and in particular the sequences of the 
CDRs of TF8-5G9, may be modified by insertions, 
substitutions and deletions to the extent that the CDR- 

25 grafted antibody maintains the ability to bind to and 
inhibit human tissue factor. The ordinarily skilled 
artisan can ascertain the maintenance of this activity 
by performing the functional assays described 
hereinbelow. The CDRs can have, for example, from about 

30 50% to about 100% homology to the CDRs of SEQ ID N0S:5- 
10. In a preferred embodiment the CDRs have from about 
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80% to about 100% homology to the CDRs of SEQ ID NOS:5- 
1 10- In a more preferred embodiment the CDRs have from 
about 90% to about 100% homology to the CDRs of SEQ ID 
NOS:5-10. In a most preferred embodiment the CDRs have 
from about 100% homology to the CDRs of SEQ ID NOS:5-10. 
5 The FR and C regions of the CDR-grafted 

antibodies of the present invention are derived from one 
or more human antibodies. Human antibodies of the same 
class and type as the antibody from which the CDRs are 
derived are preferred. The FR of the variable region of 

10 the heavy chain is preferably derived from the human 
antibody KOL (Schmidt et al., 1983 , Hoppe-Seyler [ s Z. 
Physiol. Chem, 364:713) The FR of the variable region 
of the light chain is preferably derived from the human 
antibody RE I (Epp et al. , 1974, Eur. J. Biochem. 

15 45:513). In accordance with the present invention, it 
has been discovered that certain residues of the human 
FR are preferably replaced by the corresponding residue 
of the non-human antibody from which the CDRs are 
derived. For example, certain FR residues of TF8-5G9 

20 are preferably retained to achieve optimal binding to 
antigen. 

For convenience, the numbering scheme of Kabat 
et al. has been adopted herein. Residues are designated 
by lower case numbers or hyphens as necessary to conform 
25 the present sequences to the standard Kabat numbered 
sequence. 

In accordance with the present invention, 
residues that are retained in the FR region, i.e 
residues that are not replaced by human FR residues, are 
30 determined according to the following guidelines. 

Residues that are idiosyncratic to the parent antibody, 



35 



WO 96/40921 



-13- 



PCT/US96/09287 



e.g. TF8-5G9, relative to a human consensus sequence of 
1 Kabat et al, are retained. Residues of the parent 
antibody that are in agreement with the consensus 
sequence are retained if the corresponding residue of 
the human antibody, e.g. KOL or RE I , is idiosyncratic. 
5 Residues that are part of the antibody loop canonical 
structures defined by Chothia et al. (1989) Nature 
342 :877, such as residue 71 of the heavy and light 
chains, are retained. FR residues predicted to form 
loops, such as residues 28-30 of the heavy chain, are 

10 retained. FR residues predicted to influence the 
conformation of the CDRs such as residues 48 and 49 
preceding CDR2 of the heavy chain, are retained. 
Residues that have been demonstrated to be critical in 
the humanization of other antibodies may also be 

15 retained. The foregoing guidelines are followed to the 
extent necessary to support the antigen binding site 
formed by the CDRs while simultaneously maximizing the 
humanization of the antibody. 

The amino acid sequence of a representative 

20 CDR-grafted heavy chain variable region derived from 
murine monoclonal antibody TF8-5G9 and human antibody 
KOL is shown below. The CDR-grafted heavy chain is 
designated TF8HCDR1; murine residues were retained in 
the FR at residues 6, 17, 23, 24, 28, 29, 30, 48, 49, 

25 68, 71, 73, 78, 88 and 91. CDRs are underlined. 

10 20 30 35ab 50 

QVQLVQSGGG WQPGRLLRL SCKASGFNIK DYYMH — WVR QAPGKGLEWIG 
52abc 60 70 80 82abc 90 

LIDP — ENGNTIYD PKFQGR FSIS ADTSK — NTAFL QMDSLRPEDTAVY 
100 110 

30 YCA RDNSYYF DYWGQGTPVT VSS (SEQ ID NO: 11) 
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The amino acid sequence of a representative 
1 CDR-grafted light chain variable region derived from 
murine monoclonal antibody TF8-5G9 and human antibody 
REI is shown below. The CDR-grafted light chain is 
designated TF8LCDR1; murine residues were retained in 
5 the FR at residues 39, 41, 46 and 105. CDRs are 
underlined. 



10 20 30 40 50 

DIQMTQSPSS LSASVGDRVT ITC KASQDIR KYLNW YQQK WKAPKTLIYY 

60 70 80 90 100 

ATSLADGVPS RFSGSGSGTD YTFTISSLQP EDIATYYC LQ HGESPYT FGQ 



GTKLEITR (SEQ ID NO: 12) 



A CDR-grafted antibody containing variable 
15 regions TF8HCDR1 and TF8LCDR1 has been demonstrated in 
accordance with the present invention to be as effective 
as murine monoclonal antibody TF8-5G9 in binding to 
human tissue factor. It has been further discovered in 
accordance with the present invention, by examination of 
20 the molecular structure of murine monoclonal antibody 
TF8-5G9, and by design, construction, and analysis of 
CDR-grafted antibodies, that the FR regions can be 
further humanized without the loss of antigen binding 
activity. In particular, the FR region may retain the 
25 human FR residue at residues 6, 17, 68, 73 and 78 of the 
heavy chain, and residues 39, 41, 16 and 105 of the 
light chain, with maintenance of antigen binding 
activity. 

In a most preferred embodiment, the heavy 
30 chain variable region contains a FR derived from human 
antibody KOL in which murine monoclonal antibody TF8-5G9 
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residues are retained at amino acids 23, 24, 28, 29, 30, 
1 48, 49, 71, 88 and 91. The preferred heavy chain 
variable region is designated TF8HCDR20 and has the 
following sequence. 

5 10 20 30 35ab 50 

QVQLVESGGG WQPGRSLRL SCKASGFNIK DYYMH —WVR QAPGKGLEWIGL 

52abc 60 70 80 82abc 90 100 

I DP ENGNTIYD PKFQG RFTIS ADNSKNTLFL QMDSLRPEDTAVY YCA RDNSYYF 

10 no 

DYWGQGTPVT VSS (SEQ ID NO: 13) 

In a most preferred embodiment, the light 
chain variable region contains a FR derived from human 
15 antibody RE I in which murine monoclonal antibody TF8-5G9 
residues are retained at amino acids 39 and 105. The 
preferred light chain variable region is designated 
TF8LCDR20 and has the following sequence. 

10 20 30 40 50 

^ DIQMTQSPSS LSASVGDRVT IT CKASQDIR KYLNW YOOKP GKAPKLLIYY 

60 70 80 90 100 

ATSLAD GVPS RFSGSGSGTD YTFTISSLQP EDIATYY CLQ HGESPYT FGQ 
GTKLEITR (SEQ ID NO: 14) 

It is within the ken of the ordinarily skilled 
25 artisan to make minor modifications of the foregoing 

sequences, including amino acid substitutions, deletions 
and insertions. Any such modifications are within the 
scope of the present invention so long as the resulting 
CDR-grafted antibody maintains the ability to bind to 
30 and inhibit human tissue factor. The ordinarily skilled 
artisan can assess the activity of the CDR-grafted 
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antibody with reference to the functional assays 
1 described hereinbelow. 

The human constant region of the CDR-grafted 
antibodies of the present invention is selected to 
minimize effector function. The intended use of the 
5 CDR-grafted antibodies of the present invention is to 
block the coagulation cascade by inhibition of tissue 
factor, and thus antibody effector functions such as 
fixation of complement are not desirable. Antibodies 
with minimal effector functions include IgG2, IgG4, IgA, 

10 IgD and IgE. In a preferred embodiment of the present 
invention, the heavy chain constant region is the human 
IgG4 constant region, and the light chain constant 
region is the human IgG4 kappa constant region. 

In that effector functions may not be 

15 desirable for therapeutic uses, the present invention 

further contemplates active fragments of the CDR-grafted 
antibodies, and in particular Fab fragments and F(ab') 2 
fragments. Active fragments are those fragments capable 
of inhibiting human tissue factor. Fab fragments and 

20 F(ab') 2 fragments may be obtained by conventional means, 
for example by cleavage of the CDR-grafted antibodies of 
the invention with an appropriate proteolytic enzyme 
such as papain or pepsin, or by recombinant production. 
The active fragments maintain the antigen binding sites 

25 of the CDR-grafted antibodies and thus are similarly 
useful therapeutically. 

The ability of the CDR-grafted antibodies 
designed and constructed as taught in accordance with 
the present invention to bind and inhibit human tissue 

30 factor can be assessed by functional assays. For 
example, in a rapid and convenient assay, expression 
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vectors containing nucleic acids encoding the CDR- 
1 grafted heavy and light chains can be co-transf ected 

into suitable host cells and transiently expressed. The 
resulting antibodies can be assessed by standard assays 
for ability to bind human tissue factor, and for ability 
5 to compete for binding to tissue factor with the non- 
human antibody from which the CDRs are derived. 

For example , transient expression of nucleic 
acids encoding the CDR-grafted heavy and light chains in 
COS cells provides a rapid and convenient system to test 

10 antibody gene expression and function. Nucleic acids 
encoding the CDR-grafted heavy and light chains, 
respectively, are cloned into a mammalian cell 
expression vector, for example pSG5, described by Green 
et al. (1988) Nucleic Acids Res. 16:369 and commercially 

15 available from Stratagene Cloning Systems, La Jolla, CA. 
The pSG5 expression vector provides unique restriction 
sites for the insertion of the heavy and light chain 
genes, and in vivo expression is under the control of 
the SV40 early promoter. Transcriptional termination is 

20 signaled by the SV40 polyadenylation signal sequence. 

The pSG5-based expression vectors containing 
nucleic acids encoding the heavy and light chains are 
cotransfected into COS cells and cultured under 
conditions suitable for transient expression. Cell 

25 culture media is then harvested and examined for 

antibody expression, for example by an enzyme linked 
immunosorbent assay (ELISA), to determine that suitable 
levels of antibody have been produced. An ELISA may 
then be used to assess the ability of the CDR-grafted 

30 antibody to bind to human tissue factor. Human tissue 
factor is immobilized on a microtiter plate and the COS 



35 



WO 96/40921 



PCT/US96/09287 



-18- 

cell supernatant containing the CDR-grafted antibody is 
1 added followed by an incubation at room temperature for 
about one hour. The plates are then washed with a 
suitable detergent-containing buffer such as phosphate 
buffered saline (PBS)/Tween, followed by the addition of 
5 the components of a suitable detection system. For 
example, horseradish peroxidase conjugated goat anti- 
human kappa chain polyclonal antibody is added, followed 
by washing, followed by addition of substrate for 
horseradish peroxidase, and detection. The CDR-grafted 

10 antibodies within the scope of the present invention are 
those which are capable of binding to human tissue 
factor to a degree comparable to the non-human antibody 
from which the CDRs are derived as determined by the 
foregoing assay. 

15 The ability of the CDR-grafted antibodies to 

inhibit the activity of human tissue factor in vivo can 
be conveniently assessed by the following in vitro assay 
that mimics in vivo coagulation events. In response to 
vascular injury in vivo y tissue factor binds to factor 

20 VII and facilitates the conversion of factor VII to a 
serine protease (factor Vila). The factor Vila-tissue 
factor complex converts factor X to a serine protease 
(factor Xa). Factor Xa forms a complex with factor Va 
(from the intrinsic coagulation pathway), resulting in 

25 the conversion of prothrombin to thrombin, which in turn 
results in the conversion of fibrinogen to fibrin. In a 
convenient in vitro functional assay, tissue factor is 
incubated in the presence of factor Vila and the CDR- 
grafted anti-tissue factor antibody produced in the 

30 transient expression system described above. Factcr X 
is added and the reaction mixture is incubated, followed 
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by an assay for factor Xa activity utilizing a 
1 chromogenic substrate for factor Xa ( Spectrozyme FXa, 
American Diagnostica, Inc., Greenwich, CT) . The ability 
of the CDR-grafted antibody to inhibit factor X 
activation thus provides a measure of the ability of the 
5 CDR-grafted antibody to inhibit the activity of human 
tissue factor. 

The CDR-grafted antibodies within the scope of 
the present invention are those which are capable of 
inhibiting human tissue factor to a degree comparable to 

10 the non-human antibody from which the CDRs are derived 
as determined by the foregoing assay. In one 
embodiment, the CDR-grafted antibody has at least 50% of 
the inhibitory activity of TF8-5G9 for human tissue 
factor. In a preferred embodiment, the CDR-grafted 

15 antibody has at least 70% of the inhibitory activity of 
TF8-5G9 for human tissue factor. In a more preferred 
embodiment, the CDR-grafted antibody has at least 80% of 
the inhibitory activity of TF8-5G9 for human tissue 
factor, in a most preferred embodiment, the CDR-grafted 

20 antibody has at least 90% of the inhibitory activity of 
TF8-5G9 for human tissue factor. 

In another embodiment, the present invention 
provides a method of producing a CDR-grafted antibody 
capable of inhibiting human tissue factor. The method 

25 comprises constructing an expression vector containing a 
nucleic acid encoding the CDR-grafted antibody heavy 
chain and an expression vector containing a nucleic acid 
encoding the CDR-grafted antibody light chain, 
transfecting suitable host cells with the expression 

30 vectors, culturing the transfected host cells under 

conditions suitable for the expression of the heavy and 
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light chains, and recovering the CDR-grafted antibody. 
1 Alternately, one expression vector containing nucleic 
acids encoding the heavy and light chains may be 
utilized. 

Standard molecular biological techniques, for 
5 example as disclosed by Sambrook et al. (1989), 

Molecular Cloning: A Laboratory Manual Cold Spring 
Harbor Press, Cold Spring Harbor, NY may be used to 
obtain nucleic acids encoding the heavy and light chains 
of the CDR-grafted antibodies of the present invention. 

10 A nucleic acid encoding the CDR-grafted variable domain 
may be constructed by isolating cDNA encoding the 
antibody to be humanized, e.g. murine monoclonal 
antibody TF8-5G9, by conventional cloning methodology 
from the hybridoma producing the antibody, or by 

15 polymerase chain reaction (PCR) amplification of the 
variable region genes, as described for example by 
Winter et al., followed by site-directed mutagenesis to 
substitute nucleotides encoding the desired human 
residues into the FR regions. Alternately, the cDNA 

20 encoding the human antibody can be isolated, followed by 
site-directed mutagenesis to substitute nucleotides 
encoding the desired murine residues into the CDRs. 

Nucleic acids encoding the CDR-grafted 
variable domain may also be synthesized by assembling 

25 synthetic oligonucleotides, for example utilizing DNA 
polymerase and DNA ligase. The resulting synthetic 
variable regions may then be amplified by PCR. Nucleic 
acids encoding CDR-grafted variable domains may also be 
constructed by PCR strand overlap methods that are known 

30 in the art and reviewed by Owens et al. 
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Accordingly, having determined the desired 
1 amino acid sequences of the CDR-grafted variable domains 
in accordance with the present invention, the ordinarily 
skilled artisan can obtain nucleic acids encoding the 
variable domains. Further, the skilled artisan is aware 
5 that due to the degeneracy of the genetic code, various 
nucleic acid sequences can be constructed that encode 
the CDR-grafted variable domains. All such nucleic acid 
sequence are contemplated by the present invention. 

The nucleic acids encoding the CDR-grafted 
10 variable domains are linked to appropriate nucleic acids 
encoding the human antibody heavy or light chain 
constant region. Nucleic acid sequences encoding human 
heavy and light chain constant regions are known in the 
art. It is within the ken of the ordinarily skilled 
15 artisan to include sequences that facilitate 

transcription, translation and secretion, for example 
start codons, leader sequences, the Kozak consensus 
sequence (Kozak, 1987, J. Mol. Biol. 196:947) and the 
like, as well as restriction endonuclease sites to 
20 facilitate cloning into expression vectors. 

The present invention thus further provides 
nucleic acids encoding the heavy and light chains of 
CDR-grafted antibodies capable of inhibiting human 
tissue factor wherein the CDRs are derived from a murine 
25 monoclonal antibody against tissue factor and the FR and 
C regions are derived from one or more human antibodies • 

In accordance with the present invention, 
representative nucleic acids encoding CDR-grafted heavy 
and light chains were constructed. The CDR-grafted 
30 heavy chain comprises a variable region containing FR 

regions derived from human antibody KOL and CDRs derived 
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from murine monoclonal antibody TF8-5G9 and further 
1 comprises a constant region derived from the heavy chain 
of human IgG4 . The CDR-grafted light chain comprises a 
variable region containing FR regions derived from human 
antibody RE I and CDRs derived from murine monoclonal 
5 antibody TF8-5G9 and further comprises a constant region 
derived from human IgG4 kappa chain. Nucleic acids 
encoding the heavy and light chains were constructed by 
assembling the variable regions from synthetic 
nucleotides, amplifying the assembled variable regions 

10 by PCR, purifying the amplified nucleic acids, and 

ligating the nucleic acid encoding the variable region 
into a vector containing a nucleic acid encoding the 
appropriate human constant region. 

The sequences of representative nucleic acids 

15 encoding CDR-grafted heavy and light chains are 

presented as nucleotides 1-2360 of SEQ ID NO: 15 and 
nucleotides 1-759 of SEQ ID NO: 20, respectively. 

The nucleic acid sequence encoding a preferred 
heavy chain (nucleotides 1-2360 of SEQ ID NO: 15) is 

20 designated the TF8HCDR20 gene. The nucleic acid 
sequence contains the following regions: 5' EcoRI 
restriction site (nucleotides 1-6); Kozak sequence 
(nucleotides 7-15); start codon and leader sequence 
(nucleotides 16-72); CDR-grafted variable region 

25 (nucleotides 73-423); human IgG4 CHI domain (nucleotides 
424-717); human IgG4 intron 2 (nucleotides 718-1110); 
human IgG4 hinge (nucleotides 1111-1146); human IgG4 
intron 3 (nucleotides 1147-1267); human IgG4 CH2 domain 
(nucleotides 1268-1594); human IgG4 intron 4 

30 (nucleotides 1595-1691); human IgG4 CH3 domain 
(nucleotides 1692-2012); 3' untranslated region 
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(nucleotides 2013-2354); 3' BamHI end spliced to Bel l 
1 site of expression vector (nucleotides 2355-2360). 

The nucleic acid sequence encoding a preferred 
light chain gene (nucleotides 1-759 of SEQ ID NO: 20) is 
designated the TF8LCDR3 gene. The nucleic acid sequence 
5 contains the following regions: 5 f Eco RI restriction 
site (nucleotides 1-5); Kozak sequence (nucleotides 6- 
8); start codon and leader sequence (nucleotides 9-68); 
CDR-grafted variable region (nucleotides 69-392); human 
kappa constant region (nucleotides 393-710); 3* 
10 untranslated region (nucleotides 711-753); 3' BamH I end 
spliced to Bel l site of expression vector (nucleotides 
754-759) . 

The foregoing preferred sequences can be 
modified by the ordinarily skilled artisan to take into 

15 account degeneracy of the genetic code, and to make 
additions , deletions , and conservative and 
nonconservative substitutions that result in a 
maintenance of the function of the nucleic acid, i.e. 
that It encodes a heavy or light chain of a CDR-grafted 

20 antibody capable of inhibiting human tissue factor. 
Restriction sites and sequences that facilitate 
transcription and translation may be altered or 
substituted as necessary depending upon the vector and 
host system chosen for expression. 

25 Suitable expression vectors and hosts for 

production of the CDR-grafted antibodies of the present 
invention are known to the ordinarily skilled artisan. 
The expression vectors contain regulatory sequences, 
such as replicons and promoters, capable of directing 

30 replication and expression of heterologous nucleic acids 
sequences in a particular host cell. The vectors may 
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also contain selection genes , enhancers, signal 
1 sequences, ribosome binding sites, RNA splice sites, 
polyadenylation sites, transcriptional terminator 
sequences, and so on. The vectors may be constructed by 
conventional methods well-known in the art, or obtained 
5 from commercial sources. The expression vectors 

preferably have convenient restriction sites at which 
the nucleic acids encoding the antibody chains of the 
invention are inserted. Myeloma expression vectors in 
which antibody gene expression is driven by the human 
10 cytomegalovirus promoter-enhancer or are particularly 
preferred. 

Expression vectors containing a nucleic acid 
encoding the CDR-grafted heavy chain under the control 
of a suitable promoter and expression vectors containing 

15 a nucleic acid encoding the CDR-grafted light chain 
under the control of a suitable promoter are 
cotransfected into a suitable host cell. In another 
embodiment, nucleic acids encoding both heavy and light 
chains are provided in a single vector for transfection 

20 of a suitable host cell. 

Suitable host cells or cell lines for 
expression of the CDR-grafted antibodies of the present 
invention include bacterial cells, yeast cells, insect 
cells, and mammalian cells such as Chinese hamster ovary 

25 (CHO) cells, COS cells, fibroblast cells and myeloid 
cells. Mammalian cells are preferred. CHO, COS and 
myeloma cells are particularly preferred. Myeloma cells 
are preferred for establishing permanent CDR-grafted 
antibody producing cell lines. Expression of antibodies 

30 in myeloma cells, bacteria, and yeast is reviewed by 
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Sandhu (1992) Critical Reviews in Biotechnology 12:437. 
1 Expression in mammalian cells is reviewed by Owen et al. 

Transfection of host cells by the expression 
vectors containing nucleic acids encoding the CDR- 
grafted heavy and light chains can be accomplished by 
5 methods well-known to one of ordinary skill in the art. 
Such methods include, for example, calcium chloride 
transfection, calcium phosphate transfection, 
lipofection and electroporation. Suitable culture 
methods and conditions for the production of the CDR- 

10 grafted antibodies are likewise well-known in the art. 
The CDR-grafted antibodies can be purified by 
conventional methods, including ammonium sulfate 
precipitation, affinity chromatography, gel 
electrophoresis, and the like. The ability of the CDR- 

15 grafted antibodies to bind to and inhibit human tissue 
factor can be assessed by the in vitro assays described 
above . 

The CDR-grafted antibodies of the present 
invention have a variety of utilities. For example, the 

20 antibodies are capable of binding to human tissue factor 
and thus are useful in assays for human tissue factor 
from body fluid samples, purification of human tissue 
factor, and so on. 

The CDR-grafted antibodies of the present 

25 invention are capable of inhibiting human tissue factor. 
Human tissue factor is well-known to be an essential 
element in the human coagulation cascade. The ability 
of the antibodies of the present invention to disrupt 
the coagulation cascade is demonstrated by in vitro 

30 assays in which the antibodies prevent factor X 

activation. Accordingly, the present antibodies are 
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useful in the attenuation of coagulation. The present 
1 invention thus provides a method of attenuation of 

coagulation comprising administering a therapeutically 
effective amount of CDR-grafted antibody capable of 
inhibiting human tissue factor to a patient in need of 
5 such attenuation. 

Numerous thrombotic disorders are 
characterized by excessive or inappropriate coagulation 
and are effectively treated or prevented by 
administration of agents that interfere with the 

10 coagulation cascade. Accordingly, the present invention 
further provides a method of treatment or prevention of 
a thrombotic disorder comprising administering a 
therapeutically effective amount of a CDR-grafted 
antibody capable of inhibiting human tissue factor to a 

15 patient in need of such treatment or prevention. In a 
preferred embodiment, the thrombotic disorder is 
intravascular coagulation, arterial restenosis or 
arteriosclerosis. The antibodies of the invention may be 
used in combination with other antibodies or therapeutic 

20 agents . 

A therapeutically effective amount of the 
antibodies of the present invention can be determined by 
the ordinarily skilled artisan with regard to the 
patient's condition, the condition being treated, the 

25 method of administration, and so on. A therapeutically 
effective amount is the dosage necessary to alleviate, 
eliminate, or prevent the thrombotic disorder as 
assessed by conventional parameters. For example, a 
therapeutically effective dose of a CDR-grafted antibody 

30 of the present invention may be from about 0.1 mg to 
about 20 mg per 70 kg of body weight. A preferred 
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dosage is about 1.0 mg to about 5 mg per 70 kg of body 
1 weight, 

A patient in need of such treatment is a 
patient suffering from a disorder characterized by 
inappropriate or excessive coagulation, or a patient at 
5 risk of such a disorder. For example, anticoagulant 
therapy is useful to prevent postoperative venous 
thrombosis, and arterial restenosis following balloon 
angioplasty. 

The CDR-grafted antibodies of the present 

10 invention are useful in the same manner as comparable 
therapeutic agents, and the dosage level is of the same 
order of magnitude as is generally employed with those 
comparable therapeutic agents. The present antibodies 
may be administered in combination with a 

15 pharmaceutically acceptable carrier by methods known to 
one of ordinary skill in the art. 

Another embodiment of the present invention is 
directed to a pharmaceutical composition comprising a 
least one CDR-grafted antibody capable of inhibiting 

20 human tissue factor and further comprising a 

pharmaceutical ly acceptable carrier. As used herein, 
"pharmaceutical^ acceptable carrier" includes any and 
all solvents, dispersion media, coatings, antibacterial 
and antifungal agents, isotonic and absorption delaying 

25 agents, and the like. The use of such media and agents 
for pharmaceutically active substances is well-known in 
the art. Except insofar as any conventional media or 
agent is incompatible with the active ingredient, its 
use in the therapeutic compositions is contemplated. 

30 Supplementary active ingredients can also be 
incorporated into the compositions. 
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The antibodies can be administered by well- 
1 known routes including oral and parenteral, e.g., 
intravenous , intramuscular , intranasal , intradermal , 
subcutaneous, and the like* Parenteral administration 
and particularly intravenous administration is 
5 preferred. Depending on the route of administration, 
the pharmaceutical composition may require protective 
coatings . 

The pharmaceutical forms suitable for 
injectionable use include sterile aqueous solutions or 

10 dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or 
dispersions. In all cases the ultimate solution form 
must be sterile and fluid. Typical carriers include a 
solvent or dispersion medium containing, for example, 

15 water buffered aqueous solutions (i.e., biocompatible 
buffers), ethanol, polyol such as glycerol, propylene 
glycol, polyethylene glycol, suitable mixtures thereof, 
surfactants or vegetable oils. The antibodies may be 
incorporated into liposomes for parenteral 

20 administration. Sterilization can be accomplished by an 
art-recognized techniques, including but not limited to, 
addition of antibacterial or antifungal agents, for 
example, paraben, chlorobutanol, phenol, sorbic acid or 
thimersal. Further, isotonic agents such as sugars or 

25 sodium chloride may be incorporated in the subject 
compositions. 

Production of sterile injectable solutions 
containing the subject antibodies is accomplished by 
incorporating these antibodies in the required amount in 

30 the appropriate solvent with various ingredients 
enumerated above, as required, followed by 
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sterilization, preferably filter sterilization. To 
1 obtain a sterile powder, the above solutions are vacuum- 
dried or freeze-dried as necessary. 

The following examples further illustrate the 
present invention. 

5 
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EXAMPLE 1 

1 Isolation and Sequencing of TF8-5G9 

Light Chain (LC) and Heavy Chain (HC) 



Two DNA libraries were generated from oligo 
5 (dT) -primed TF8-5G9 hybridoma RNA utilizing standard 
molecular biology procedures as described by Sambrook et 
al. The cDNA. was cloned into the Librarian II plasmid 
vector from Invitrogen (San Diego, CA) , and the 
libraries were screened for cDNA clones encoding murine 

10 IgG HC and LC. A full-length cDNA clone for the heavy 
chain could not be isolated, despite the construction of 
two independent libraries. A random primed TF8-5G9 cDNA 
library was generated to obtain the missing 5* sequence 
of the heavy chain. Consequently, the heavy chain cDNA 

15 was in two pieces: a 5' clone of 390 nucleotides and a 
3' clone of 1392 nucleotides. The two HC clones overlap 
by 292 nucleotides. 

The HC and LC clones were completely sequenced 
by the dideoxy chain termination method of Sanger et al. 

20 (1977) Proc. Natl. Acad. Sci. USA 74:5463. To verify 

the variable region sequence, sequence was obtained from 
PCR-amplified cDNA that had been synthesized from total 
TF8-5G9 hybridoma RNA. Total TF8-5G9 hybridoma RNA was 
isolated by the guanidinium thiocyanate method of 

25 Chrigwin et al. (1970) Biochemistry 18:5294. cDNA was 
synthesized using the Perkin Elmer (Norwalk, CT) GeneAmp 
RNA Polymerase Chain Reaction (PCR) kit with an oligo 
(dT) primer. Components of the same kit were used in 
the PCR to amplify the LC and HC variable regions using 

30 primers based on the sequence that had been obtained for 
the cDNA clones. The amplified variable region 
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fragments were gel-purified and sequenced according to 
1 the method of Tracy et al. (1991) BioTechniques 1_1:68 o 
a Model 373A Applied Biosystems, Inc. (Foster City, CA) 
automated fluorescent DNA sequencer. The sequence for 
TF8-5G9 LC and HC obtained from RNA amplification and 
5 the sequence obtained from the cDNA clones agreed. The 
TF8-5G9 HC variable region sequence with protein 
translation is shown in Figure 1 and SEQ ID NO:l, and 
that for the LC is shown in Figure 2 and SEQ ID NO: 3. 

10 
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EXAMPLE 2 

1 Chimeric LC and HC Expression Vector Construction 

In order to test the binding activity of the 
CDR-grafted anti-TF LC and HC individually, mouse-human 
5 chimeric TF8-5G9 LC and HC were constructed. This 

allowed the CDR-grafted LC to be tested for TF binding 
ability in combination with the chimeric HC, and the 
CDR-grafted HC to be tested in combination with the 
chimeric LC. 

10 Primers were designed to amplify the TF8-5G9 

LC variable region using as template cDNA clones in the 
Librarian II vector. The 5* primer was designed with an 
EcoRI site while the 3* primer was designed with a Narl 
site. PCR was used to amplify the LC variable region, 

15 generating a 433 bp fragment with a 5' EcoRI end and 

3 ' Nar l end. The fragment included the signal sequence 
from the TF8-5G9 LC cDNA clone but incorporated a 2 base 
change in the arginine codon immediately following the 
ATG start codon. This change retained the arginine 

20 residue but made the sequence conform to the Kozak 
consensus sequence in order to potentially improve 
translation of the LC mRNA. The PCR amplified LC 
variable region fragment was digested with Eco RI and 
Nar l restriction enzymes and purified by electrophoresis 

25 on a 2% Nusieve, 1% Seakem agarose gel (FMC Bio 
Products, Rockland, ME). 

The DNA was extracted from the gel slice and 
purified by the Geneclean (Bio 101, La Jolla, CA) 
procedure. The full-length chimeric TF8-5G9 LC gene was 

30 generated by cloning this DNA into the EcoRI and Nar l 
sites of a pSP73 vector (Promega, Madison, WI) which 
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contains the human kappa constant region. The gene was 
1 isolated from the pSP73 vector by EcoRI digestion and 
subcloned into the Eco RI site of the pSG5 mammalian cell 
expression vector (Stratagene Cloning Systems, La Jolla, 
CA) . 

5 The chimeric TF8-5G9 HC gene was assembled in 

a manner similar to that of the chimeric LC. Since 
there was no full-length HC cDNA isolated from the 
Librarian II vector cDNA libraries, the HC variable 
region fragment that was generated by the PCR from total 

10 TF8-5G9 hybridoma cell RNA was used as the template. 
Primers which incorporated an Eco RI site at the 5' end 
and a Sac I site at the 3' end were used in the PCR to 
generate a 430 bp fragment which contained the TF8-5G9 
HC Kozak sequence, start codon, signal sequence, and 

15 variable region. This fragment was digested with the 
restriction enzymes Eco RI and Sad , and gel-purified 
using the same procedure that was used with the chimeric 
LC construction. 

The full-length TF8-5G9 chimeric HC gene was 

20 constructed by cloning the variable region fragment into 
the EcoRI and SacI sites of the pSG5 expression vector 
containing the human IgG4 constant region. 

25 
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EXAMPLE 3 

1 Design and Construction of the 

CDR-Grafted Heavy and Light Chain Genes 

The variable region domains of the CDR-grafted 
5 HC and LC genes were designed with an EcoRI overhang at 
the 5* end followed by a Kozak sequence to improve 
antibody expression. The leader sequences were derived 
from the heavy and light chains of the murine monoclonal 
antibody B72.3 (Whittle et al. (1987) Protein 

10 Engineering l.:499). The 3* end of the variable regions 
were designed to have overhangs which allowed for 
splicing to the appropriate human constant region DNA. 

In the initially designed CDR-grafted TF8-5G9 
heavy and light chains the CDRs were derived from murine 

15 TF8-5G9 sequence while the frameworks were derived 
primarily from human antibody sequence. The human 
antibody KOL (Schmidt et al.) was used for the heavy 
chain frameworks, while the human antibody dimer (Epp et 
al.) was used for the light chain frameworks. 

20 Several criteria were used to select murine 

framework residues in the design of the TF8-5G9 CDR- 
grafted heavy and light chain variable regions. 
Framework residues which, at a particular position, are 
idiosyncratic to TF8-5G9 were retained as murine 

25 sequence with the assumption that they contributed to 
its unique binding characteristics. TF8-5G9 murine 
residues were also retained at framework positions where 
they were in agreement with the human consensus sequence 
but where the corresponding residues in KOL or RE I were 

30 idiosyncratic. Residues that are part of antibody loop 
canonical structures such as residue 71 (numbering 



35 



WO 96/40921 



-35- 



PCI7US96/09287 



according to Kabat et al . ) of the heavy and light chains 

1 were also retained as murine sequence. Framework 

residues that form loops such as residues 26-30 of the 
HC were kept as TF8-5G9 murine sequence at positions 
were the murine sequence differed from the human. 
5 Residues known to directly influence the conformation of 
CDRs, such as 48 and 49 immediately preceding CDR2 of 
the HC, were also retained as murine sequence. 

The amino acid sequence of the variable region 
for the initially designed CDR-grafted TF8-5G9 HC, 

10 TF8HCDR1, is shown in SEQ ID NO: 11. Murine residues 
were retained at framework positions 6, 17, 23, 24, 28, 
29, 30, 48, 49, 68, 71, 73, 78 88 and 91. The CDR- 
grafted HC variable region was attached to a human IgG4 
constant region. 

15 The amino acid sequence of the variable region 

for the initially designed CDR-grafted TF8-5G9 LC, 
TF8LCDR1, is shown in SEQ ID NO: 12. Murine residues 
were retained at framework positions 39, 41, 46 and 105. 
The CDR-grafted LC variable region was attached to a 

20 human kappa constant region. 

The variable region for the CDR-grafted HC and 
LC described above were each assembled from 13 synthetic 
oligonucleotides which were synthesized by Research 
Genetics, Inc., Huntsville, AL. These oligonucleotides 

25 ranged in length from 42 to 80 bases, and encoded both 
variable region strands. When the 6 complementary 
oligonucleotide pairs were annealed, the overhangs 
generated were 17 to 24 bases in length. These 
oligonucleotide pairs were combined, annealed at their 

30 complementary overhangs, and ligated to give the final 
full length double-stranded variable regions. 
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The HC variable region oligonucleotides were 
1 assembled into a 452 bp fragment which contains a 5* 
EcoRI site and a 3* SacI site. The polymerase chain 
reaction was used to amplify this fragment. The 
resulting amplified DNA was purified on a 2% Nusieve, 1% 
5 Seakem agarose gel (FMC) . The appropriate size band of 
DNA was excised and the DNA was recovered by the 
Geneclean (Bio 101) procedure. The fragment was then 
digested with Eco RI and Sac I, and purified again by the 
Geneclean method. This HC variable region fragment with 

10 Eco RI and Sac I ends was cloned into the Eco RI and Sac I 
sites of the pSport-1 vector (GIBCO-BRL Life 
Technologies, Gaithersburg, MD) . DNA from several 
clones was isolated and sequenced to verify proper 
variable region assembly. All clones had unexpected 

15 base changes. One clone with the fewest base changes 

(two mismatches at bases 133 and 140) was selected to be 
corrected by site-directed mutagenesis according to 
Kunkel (1985) Proc. Natl. Acad. Sci . USA 82:488. 
Briefly , CJ236 (ung-, dut-) competent cells (Invitrogen 

20 Corporation, San Diego, CA) were transformed with the 
pSport vector containing the CDR-grafted HC variable 
region with the two base mismatch. Single-stranded, 
uridine-incorporated DNA templates were purified from 
phage following M13 helper phage (Stratagene Cloning 

25 Systems) infection of the transformed cells. 

Mutagenesis oligos containing the desired base changes 
were synthesized on an Applied Biosystems Model 380B DNA 
synthesizer. The mutagenesis oligos were annealed to 
the template DNA, and T7 DNA Polymerase and T4 DNA 

30 Ligase (MutaGene InVitro Mutagenesis Kit, Bo-Rad 

Laboratories, Richmond, CA) were used to incorporate the 
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oligo into a newly synthesized DNA strand. DH5a 
1 competent cells (GIBCO-BRL Life Technologies) were 

transformed with the double-stranded DNA. The original 
uridine-incorporated strand is destroyed while the newly 
synthesized strand containing the mutagenesis oligo is 
5 replicated. Phagemid DNA was prepared from the 

resulting mutagenesis clones and the variable regions 
were sequence to identify the clones which had 
incorporated the desired changes. The corrected HC 
EcoRI /Sac I variable region fragment was excised from the 

10 psport vector, purified and ligated into the Eco RI/ Sac I 
sites of a pSG5 vector containing the human IgG4 
constant region. This resulted in the generation of a 
full-length humanized TF8-5G9 HC gene, TF8HCDR1 , in the 
pSG5 COS cell expression vector. The vector was 

15 designated pSG5TF8HCDRl . 

The CDR-grafted TF8-5G9 LC variable region was 
also amplified by the PCR from the assembled synthetic 
oligonucleotides into a 433 bp fragment which contained 
a 5* Eco RI site and a 3' Nar l site. This fragment was 

20 purified as described above for the HC, digested with 
EcoRI and Nar l and purified by the Geneclean procedure* 
This fragment was cloned into the Eco RI and Nar l sites 
of a pSG5 vector which contains the human kappa constant 
region. This resulted in the generation of a full- 

25 length humanized TF8-5G9 LC gene, TF8LCDR1, in the pSG5 
COS cell expression vector. Seven clones were 
sequenced, and one was found to have the desired CDR- 
grafted LC sequence. The vector was designated 
PSQ5TF8LCDR1 . 
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EXAMPLE 4 

1 Expression of the CDR-Grafted 

Heavy and Light Chain Genes in COS Cells 

The transient expression of antibody genes in 
5 COS-1 cells provides a rapid and convenient system to 
test antibody gene expression and function. COS-1 cells 
were obtained from the American Type Culture Collection 
(CRL 1650) and cultured in Dulbecco*s Modified Eagle 
Medium ( DMEM , from GIBCO BRL Life Technologies) with 10% 

10 fetal calf serum. The pSG5TF8HCDRl expression factor 
was cotransfected into COS cells with the pSG5 chimeric 
LC expression vector using the DEAE-Dextran method 
followed by DMSO shock as described by Lopata et al. 
(1984) Nucleic Acids Res. 14:5707. After 4 days of 

15 culture, media was harvested from the wells and examined 
for antibody expression levels. 

Antibody levels were determined by an ELISA- 
based assembly assay. Plates were coated with a goat 
anti-human Fc specific antibody. Various dilutions of 

20 the COS cell supernatant containing secreted antibody 
were added, incubated for one hour, and washed. A 
horseradish peroxidase- linked goat anti-human kappa 
chain antibody was added, incubated for one hour at room 
temperature, and washed. Substrate for the horseradish 

25 peroxidase was added for detection. Antibody levels in 
the COS cell media were found to be nearly undetectable 
for the TF8HCDR1 x chimeric LC. Upon closer examination 
of the TF8HCDR1 variable region sequence, it was found 
that an unexpected base change, which had occurred 

30 during the site-directed mutagenesis process described 
in Example 3, introduced a stop codon into framework 4 
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of the TF8HCDR1 gene. This substitution was corrected 
1 by site-directed mutagenesis as described above. 

Thorough sequencing of the variable region confirmed 
that the correction was made with no additional changes 
introduced. Upon trans feet ion of this corrected 
5 TF8HCDR1 gene with the chimeric LC, reasonable 
expression levels were obtained. 

COS cells which had been co-transf ected with 
the CDR-grafted LC expression vector, pSGTF8LCDRl, and 
either the chimeric HC or TF8HCDR1 , produced antibody at 
10 reasonable levels. Antibody levels in COS cell 
supernatants ranged from 0.5 jig to 10.0 uq per ml. 
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EXAMPLE 5 

1 Binding of the CDR-Grafted TF8-5G9 to Tissue Factor 

An ELISA was used to determine the ability of 
the CDR-grafted TF8-5G9 antibody, TF8HCDR1 x TF8LCDR1 , 
5 to bind to tissue factor. Tissue factor was immobilized 
on a microtiter plate. The test COS cell supernatant, 
containing the CDR-grafted antibody, was added to the 
well, incubated for one hour at room temperature. 
Following three washes with PBS/Tween, a goat anti-human 

10 kappa chain polyclonal antibody conjugated to 

horseradish peroxidase was added, incubated for one hour 
at room temperature and washed. Substrate for the 
horseradish peroxidase was added for detection. The 
positive control was the TF8-5G9 chimeric antibody. The 

15 CDR-grafted TF8-5G9 antibody was able to bind to tissue 
factor to a degree comparable to the chimeric TF8-5G9 
antibody (Figure 3, solid symbols) • 

The ability of the humanized antibody to 
compete with murine TF8-5G9 for binding to tissue factor 

20 was also examined. Varying amounts of COS cell 

supernatant containing the test CDR-grafted antibody and 
a fixed amount of murine TF8-5G9 were added 
simultaneously to wells coated with tissue factor. 
Binding was allowed to occur for one hour at room 

25 temperature. The wells were washed three times with 
PBS/Tween. A goat anti-human kappa chain antibody 
conjugated to horseradish peroxidase was added, 
incubated for one hour at room temperature and washed. 
Substrate for the horseradish peroxidase was added for 

30 detection. The positive antibody competed as well as 
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the chimeric antibody with murine TF8-5G9 for binding to 
1 TF. 

These data indicate that the initially 
designed CDR-grafted antibody, TF8HCDR1 x TF8LCDR1 , was 
approximately as active as the chimeric TF8-5G9 in 
5 binding to TF and competing with the murine antibody for 
binding to TF. 
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EXAMPLE 6 

1 Construction and Characterization 

of Additional CDR~Grafted Heavy Chains 



Upon examination of the molecular structure of 
5 murine TF8-5G9, framework residues at positions 27 , 68, 
73 and 78 were found to lie on the antibody surface and 
had no discernible contact with the CDRs. These 
framework residues were of murine sequence in TF8HCDR1 
but were changed to the human KOL sequence in various 

10 combinations to generate a series of CDR-grafted heavy 
chains with framework residue variations. The changes 
were made by the process of site-directed mutagenesis as 
described in Example 3. Each CDR-grafted heavy chain 
version was expressed in COS cells in combination with 

15 the CDR-grafted LC, TF8LCDR1 , and tested for its ability 
to bind TF and compete with murine TF8-5G9 for binding. 
Every version of the CDR-grafted heavy chain in 
combination with TF8LCDR1 was shown to bind TF with an 
affinity comparable to chimeric TF8-5G9. Every CDR- 

20 grafted HC in combination with TF8LCDR1 was able to 
compete with murine TF8-5G9 for binding to TF to a 
degree comparable to the chimeric antibody. 

Changes in sequence from murine to human for 
HC framework positions 6, 7, 68, 73 and 78 did not 

25 adversely affect the antigen binding ability of the 
antibody. The CDR-grafted HC version which had human 
sequence at all of these positions, and thus was the 
most humanized HC, was TF8HCDR20. 

The complete sequence of the TF8HCDR20 gene 

30 was determined. The DNA sequence is shown as a 2360 bp 
EcoRI/BamHI insert with protein translation in the 
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pEe6TF8HCDR20 expression vector in Figure 4 and SEQ ID 
NO: 15 . 

The essential regions of the gene are as 

follows: 



Nucleotide # 
5 1-6 
7-15 
16-72 
73-423 
424-717 
10 718-1110 
1111-1146 
1147-1267 
1268-1594 
1595-1691 
15 1692-2012 
2013-2354 
2355-2360 



Region 

5' EcoRI restriction site 
Kozak sequence 

Start codon and leader sequence 

CDR-grafted variable region 

Human IgG4 CHI domain 

Human IgG4 intron 2 

Human IgG4 hinge 

Human IgG4 intron 3 

Human IgG4 CH2 domain 

Human IgG4 intron 4 

Human IgG4 CH3 domain 

3' untranslated region 

3* BamHI end spliced to Bel l 
site of the expression vector 
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EXAMPLE 7 

1 Construction and Characterization 

of Additional CDR-Grafted Light Chains 

The initially designed CDR-grafted LC, 
5 TF8LCDR1, contained four framework residues from the 
murine TF8-5G9 sequence. At two of these positions, 39 
and 105, the human RE I framework sequence is unique to 
REI; however, the murine TF8-5G9 LC sequence is in 
agreement with the human consensus sequence. The other 

10 two murine framework residues, trp41 and thr46, are 

unique to TF8-5G9 . Several versions of the CDR-grafted 
LC were generated in which the sequence at these four 
positions were changed from the murine to the human REI 
in various combinations. These changes were made by 

15 site-directed mutagenesis. Each version of the CDR- 
grafted LC was expressed in COS cells in combination 
with the CDR-grafted HC, TF8HCDR20, and tested for 
ability to bind tissue factor and compete with murine 
TF8-5G9 for binding. Every version of the CDR-grafted 

20 LC, in combination with TF8HCDR20, was shown to bind TF 
with an affinity comparable to TF8-5G9. Also every CDR- 
grafted LC version, in combination with TF8HCDR20, was 
able to compete with murine TF8-5G9 for binding to TF in 
a manner comparable to the chimeric TF8-5G9 control. 

25 Changes in sequence from murine to human for 

LC framework positions 39, 41, 46 and 105 did not 
adversely effect the ability of the antibody to 
recognize antigen. The CDR-grafted LC of choice was 
TF8LCDR3, where murine TF8-5G9 sequence was used at 

30 positions 39 and 105 because these are in agreement with 
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the human consensus sequence. The preferred CDR-grafted 

1 TF8-5G9 antibody is TF8HCDR20 x TF8LCDR3. 

The complete sequence of the TF8LCDR3 gene was 

determined and is shown as a 759 bp EcoRI -BamHI insert 

with protein translation in the pEel2TF8LCDR3 expression 

5 vector in Figure 5 and SEQ ID NO: 17. The essential 

regions of the gene are as follows: 

Nucleotide # Region 

1-5 5' Eco RI restriction site 

6-8 Kozak sequence 

10 9-68 Start codon and leader sequence 

69-392 CDR-grafted variable region 

393-710 Human kappa constant region 

711-753 3* untranslated region 

754-759 3 1 Bam HI end spliced to Bel l 

25 site of the expression vector 
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EXAMPLE 8 

1 CDR-Grafted TF8-5G9 Antibody TF8HCDR20 x TF8LCDR3 

Inhibits Human Tissue Factor 

The binding of the CDR-grafted TF8-5G9 
5 antibody, TF8HCDR20 x TF8LCDR3, to TF was assessed as 
described in Example 5 and was found to be comparable to 
that of the chimeric TF8-5G9 as illustrated in Figure 6. 
The ability of the CDR-grafted TF8-5G9 to compete with 
the murine antibody for binding to TF is comparable to 
10 that of the chimeric TF8-5G9 as shown in Figure 7. 

An AH vitro assay was used to measure the 
level of inhibition of factor X activation by the CDR- 
grafted TF8-5G9 antibody. In this assay, TF forms an 
active proteolytic complex with factor VII. This 
15 complex then converts factor X to factor Xa by 

proteolysis. The activated Xa enzymatically cleaves a 
substrate, Spectrozyme FXa, which releases a chromogen. 
The level of chromogen, as detected by optical density, 
is an indication of factor X activation due to TF- factor 
20 Vila activity. 

The following reaction mixtures were prepared 
in 12 x 75 mm borosilicate glass tubes. 

25 jil TBS (50 mM Tris, pH 7.4, 150 mM NaCl) 
15 ill 20 mM CaCl 2 /l% bovine serum albumin 

25 (BSA) 

20 jil human placental tissue factor solution 
(prepared by reconstituting one vial of 
Thromborel S, Curt in Matheson Scientific 
#269-338 with 4.0 ml dH 2 0 and diluting 
30 1:10 in TBS) 
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30 pi Factor VII (Enzyme Research Labs #HFVII 
1 1007 at 237.66 ng/ral in TBS) 

30 pi TBS or TF8-5G9 or TF8MCDR20 x TF8LCDR3 
at 1.18 pg/ml or as indicated in Fig. 8 
The reaction mixtures were incubated at 37 °C 
5 for ten minutes before the addition of Factor X. (In 
some cases the reaction mixture was preincubated for 
five minutes before addition of Factor VII or antibody, 
followed by a ten minute incubation before addition of 
Factor X.) Thirty pi of Factor X solution (Enzyme 
10 Research Labs, DHFX 330, 247.38 pg/ml TBS) was added and 
the mixture was incubated at 37 °C for three minutes. 
Factor X activation was terminated by pipetting 40 pg of 
reaction mixture into 160 pi of stop buffer (50 mM Tris, 
pH 7.4, 100 mM EDTA, 150 mM NaCl) in 96 well microtiter 
15 plates. Each tube of reaction mixture was pipetted into 
three microtiter wells. Fifty pi of Spectrozyme FXa 
substrate (American Diagnostica #222, lpM/ml TBS) was 
added to each well. OD 405 was read on a Molecular 
Devices, kinetic plate reader with readings taken every 
20 twenty seconds for ten minutes. Factor X activity was 
recorded as mOD/minute, and enzyme velocities over the 
linear portion of the reaction curve were compared to 
determine inhibition of factor X activation by the anti- 
TF antibodies . 

25 As shown in Figure 8, the CDR-grafted TF8-5G9 

antibody is approximately as effective as the murine 
TF8-5G9 in inhibiting factor X activation. This 
indicates that the CDR-grafted TF8-5G9 is functionally 
active . 
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EXAMPLE 9 

1 Construction of the CDR-Grafted Heavy 

and Light Chain Myeloma Expression Vectors 

For the purpose of establishing a permanent 
5 CDR-grafted antibody-producing cell line, the TF8HCDR20 
and TF8LCDR3 genes were subcloned into myeloma cell 
expression vectors. The heavy chain TF8HCDR20 was 
subcloned into the EcoRI and Bel l sites of the pEe6hCMV- 
Bql ll myeloma expression vector described by Stephens et 

10 al. (1989) Nucleic Acids Res. 1/7: 7110 to produce 

pEe6TF8HCDR20 . The light chain TF8LCDR3 was subcloned 
into the EcoT I and Bel l sites of the pEel2 myeloma 
expression vector to produce pEel2TF8LCDR3 . The heavy 
and light chain expression vectors are illustrated in 

15 Figures 9 and 10, respectively. In both vectors 
antibody gene transcription was driven by the human 
cytomegalovirus (hCMV) promoter-enhancer, which lies 
directly 5' to the multiple cloning site. The 
polyadenylation signal sequence lies 3' to the multiple 

20 cloning site and signals the termination of 

transcription. Each vector contains the fl-lactamase 
gene to allow for ampicillin selection in E. coli . The 
pEel2 vector contains a glutamine synthetase cDNA gene 
under the transcriptional control of the SV40 early 

25 promoter. Glutamine synthetase allows for myeloma cell 
trans fectants to be selected in glutamine- free media. 
Myeloma cells are devoid of glutamine synthetase 
activity and are dependent on a supply of glutamine in 
the culture media. Cells which have been transfected 

30 with the pEel2 vector, containing the glutamine 
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synthetase gene, are able to synthesize glutamine from 
1 glutamate and can survive in the absence of glutamine. 

The pEe6TF8HCDR20 expression vector is a 7073 bp 

plasmid whose DNA sequence is shown in Figure 4 and SEQ 

ID NO: 15. The coding regions of the TF8HCDR20 gene are 
5 translated. The essential regions of this vector are 

described below: 

1. Nucleotides #1-2360: The TF8HCDR20 CDR- 
grafted HC gene is described in Example 
6 . The HC gene was inserted as an 
Eco RI/ Bam HI fragment into the EcoRI/BclI 

10 sites of the pEe6hCMV- Bql II vector. 

2. Nucleotides #2361-2593: This region 
encodes the SV40 early gene 
polyadenylation signal (SV40 nucleotides 
2770-2537), which acts as a 
transcriptional terminator. This 
fragment is flanked by a 5' Bel l site and 

15 a 3' Bam HI site. The 3' BamHI end of the 

heavy chain gene was spliced to the 5 1 
Bel l site of the polyadenylation signal, 
thus eliminating both sites. 

3. Nucleotides #2594-3848: This region is a 
Bam HI- Bql l fragment from pBR328 

2o (nucleotides 375-2422) but with a 

deletion between the Sal and Ava l sites 
(pBR328 nucleotides 651-1425) following 
the addition of a Sai l linker to the Ava l 
site. This region contains the Col El 
bacterial origin of replication. 

4. Nucleotides #3849-4327: This is a Bgll- 
25 Xmn I fragment site from the ft- lactamase 

gene of pSP64 (Promega Corporation, 
Madison, WI). This gene provides 
ampicillin resistance to bacteria 
transformed with this vector. 



5. Nucleotides #4328-4885: This is an Xmnl- 
Hindlll fragment of the ColEl based 
plasmid pCT54 described by Emtage et al. 
(1983) Proc. Natl. Acad. Sci. USA 
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80:3671. The Hindlll site was converted 
1 to a Bglll site by the addition of a 

linker following the addition of the hCMV 
promoter described below . 

6- Nucleotides #4886-7022: These 

nucleotides encode the Pst-lm fragment of 
^ human cytomeglovirus (hCMV) strain AD 169 

described by Greenway et al. (1982) Gene 
JJ3:355 containing the region coding for 
the hCMV middle intermediate early 
promoter. This Pst -lm fragment was 
cloned into the Hindi I I site of pEe6hCMV 
by addition of oligonucleotides of the 
following sequence to either end of the 
10 fragment: 



5' GTCACCGTCCTTGACACGA 3' 

3' ACGTCAGTGGCAGGAACTGTGCTTCGA 5' 

The resulting 2100 bp fragment was 
15 inserted such that the promoter directed 

transcription towards the EcoRI site of 
pEe6hCMV. The oligonucleotide above 
served to recreate the complete 5' 
untranslated sequence of the hCMV-MIE 
gene the added irrelevant sequence at the 
very 5' end of the fragment. The Hindi I I 
site at the 5 f end was subsequently 
20 converted to a Bglll site by the addition 

of a further linker. 

7. Nucleotides #7023-7073: The pSP64 

polylinker with the BamHI and Sail sites 
removed. 

25 The pEel2TF8LCDR3 expression vector is a 7864 

bp plasmid whose DNA sequence is shown in Figure 5 and 

SEQ ID NO: 17. The coding regions of the TF8LCDR3 gene 

are translated. The essential regions of this 

expression vector are described below: 

o 0 1. Nucleotides #1-759: The TF8LCDR3 CDR- 

grafted LC gene is described in Example 
7 . The gene was inserted as an 
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Eco RI/ Bam HI fragment into the Eco RI/ Bcl II 
^ sites of the pEel2 expression vector, 

2. Nucleotides #760-3284: These regions of 
pEel2 are identical to the regions 
encoded by nucleotides 2361-4885 of the 
pEe6TF8HCDR20 vector described above 

^ (regions #2-5) . 

3. Nucleotides #3285-5736: This region 
encodes the Chinese hamster ovary 
glutamine synthetase cDNA under the 
transcriptional control of the SV40 early 
promoter and followed by the SV4 0 
polyadenylation and splice signals from 

10 the pSV2.dhfr vector described by 

Subramani et al. (1981) Mol. Cell. Biol. 
1:854. The following describes the 
derivation of this region: A 1200 bp 
Nael-PvuII fragment, containing a 
complete GS coding sequence, was excised 
from the Chinese hamster ovary cDNA clone 

15 A.GS1.1 described by Hayward et al. (1986) 

Nucleic Acid Res. lj[:999. After addition 
of a Hindi I I linker to the Nael site and 
a Bql ll linker to the PvuII site (hence 
destroying the Nael and PvuII sites), the 
1200 bp fragment was cloned in place of 
DHFR sequences in pSV2.dhfr between the 
Hindlll and Bql ll sites to form pSV2.GS. 

20 The single remaining Pvu II site in 

pSV2 Bam GS was converted to a BamHI site 
by addition of an oligonucleotide linker 
to form pSV2BamGS. An EcoRl site in the 
GS cDNA was destroyed by site directed 
mutagenesis without altering the amino 
acid sequence in pSV2 Bam GS and the 

25 Hind i! I site was destroyed by filling in 

with DNa polymerase I. The 2451 bp BamH I 
fragment from this plasmid, containing 
the complete SV40-GS hybrid transcription 
unit, was excised and inserted at the 
Balll site of pEe6hCMV-BqlII site of 
pEe6hCMV-BqlII such that transcription 
from the sV40 early promoter proceeds 

30 towards the hCMV promoter. 
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4. Nucleotides #5737-7864: This region is 
identical to the hCMV promoter and pSP64 
polylinker encoded by nucleotides 4886- 
7073 of the pEe6TF8HCDR2 0 vector 
described above (regions 6 and 7). 

For the purpose of ensuring that both the 

pE e 6 TF 8 HCDR2 0 and peE 12TF8LCDR3 vectors co-transf ected 

myeloma cells , the vectors were joined in linear 

concatamers. Both the pEe6TF8HCDR20 and pEel2TF8LCDR3 

vectors were digested at the unique Sai l site. The Sai l 

linearized pEe6TF8HCDR20 vector was phosphatased at its 

5' ends to prohibit ligation of two pEe6TF8HCDR20 

vectors onto each other. This phosphatased HC vector 

was ligated in a 2:1 molar ratio to the Sal linearized 

pEel2TF8LCDR3. The resulting concatamers were most 

likely of the following composition: 

Sai l Sai l sail Sail 



pEe6TF8HCDR20 



pEel2TF8LCDR3 



pEe6TF8HCDR20 



This concatamerized DNA was extracted with phenol and 
chloroform, and precipitated with ammonium acetate and 
20 ethanol. The DNA precipitate was resuspended in 

distilled water to a concentration of 1 /ig/^L and used 
to transfect myeloma cells. 
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EXAMPLE 10 

1 Development of NSO Expression Cell Lines 

Stably transformed cell lines expressing the 
humanized TF8-5G9 antibody were prepared by transfecting 
5 CDR-grafted heavy and light chain expression vectors 
into NSO mouse myeloma cells. Selection of transfected 
cells was carried out using the dominant selectable 
marker gene, glutamine synthetase (GS). 

The NSO mouse myeloma cell line, obtained from 
10 Celltech, Ltd., is a subclone derived from NS-1 and does 
not express intracellular light chains. These cells 
were cultured in Dulbecco's modified Eagle's medium 
(DMEM) with added glutamine and 10% fetal bovine serum 
( FBS ) . To prepare for transf ection, the cells were 
15 harvested in mid-log phase of the growth cycle, 
centrifuged for 5 minutes, washed with phosphate 
buffered saline (PBS), centrifuged again, and the cell 
pellet was resuspended in 2.2 mL of PBS. The final cell 
concentration was 2.18 x 10 7 mL. Cells were maintained 
20 on ice during the entire procedure. 

The DNA to be transfected (pEel2TF8LCDR3 x 
pEe6TF8HCDR2 0 ) was prepared as a concatamer as described 
in Example 9. The DNA and NSO cells were added to a 0.4 
cm BioRad Gene Pulser cuvette in the following order: 
25 40 uL (40 jig) DNA concatamer 

320 jjL double distilled water 
40 iiL 10 x PBS 

400 vL NSO cells (8.72 x 10 6 cells) 
Trans feet ion was performed by electroporation 
30 following a protocol provided by Celltech, Ltd. In this 
procedure, the cells and DNA in PBS buffer were exposed 
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to a brief, high voltage pulse of electricity causing 
1 transient micropores to form on the cell membrane. DNA 
transfer takes place through these openings. To prepare 
for electroporation, the suspension of NSO cells and DNA 
was gently mixed and incubated on ice for 5 minutes. 
5 The cuvette was placed in a BioRad Gene Pulser and given 
2 consecutive electrical pulses at settings of 3 
(capacitance) and 1.5V (voltage). Following 
electroporation, the cuvette was returned to the ice for 
5 minutes. The suspension was then diluted in prewarmed 

10 growth medium and distributed into seven 96-well plates. 
Control plates containing cells electroporated without 
DNA were also prepared at the same time to measure the 
presence of spontaneous mutants. Plates were placed in 
a 37 °C incubator with 5% C0 2 . 

15 Glutamine synthetase, encoded by the GS gene, 

is an enzyme that converts glutamate to glutamine. NSO 
cells require glutamine for growth due to inadequate 
levels of endogenous GS gene expression. In the DNA 
concatamer, this gene is located on the pEe!2TF8LCDR3 

20 vector. Trans fected cells which incorporate the GS gene 
become glutamine- independent . Cells not integrating the 
GS gene into their genome would remain glutamine- 
dependent and would not survive in glutamine- free 
medium. Approximately 18 hours post electroporation, 

25 all plates were fed with glutamine- free selection medium 
and returned to the incubator until viable colonies 
appeared . 

Approximately 3 weeks after trans fection, 
distinct macroscopic colonies were observed. These were 
30 screened for expression of the intact humanized antibody 
using the assembly ELISA as described in Example 5. 
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Tissue culture supernatants from wells containing 
1 colonies were screened at a 1:10 dilution. Positive 
wells showing activity greater than the 25 ng/mL 
standard were subcultured and expanded for further 
analysis. 

5 For selection of high producers, antibody 

production was quantitated after a 96 hour growth 
period. Tissue culture flasks were seeded with 2 x 10 5 
cells/mL in 10 mL of selection medium and incubated at 
37 °C, 5% C0 2 for 96 hours. At the end of that time 

10 period, an aliquot was taken to determine cell 
concentration and antibody titer. Evaluation of 
antibody production was calculated as yg/mL and 
pg/cell/96 hours. The highest producers from this 
trans feet ion were: 

15 Cell Line uq/mL pg/cell/96 hour 



2B1 



26.3 



24.3 



3E11 



27.6 



59.9 



4G6 



30.2 



41.9 



20 



25 



30 
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EXAMPLE 11 



CDR Grafted Antibody TF8HCDR20 x TF8LCDR3 
Inhibits Tissue Factor In Vivo 



CDR grafted antibody TF8HCDR20 x TF8LCDR3 was 



5 compared to murine antibody TF8-5G9 for its ability to 
protect rats from experimentally induced disseminated 
intravascular coagulation (DIC) . In the DIC model, rats 
are challenged with human thromboplastin (a crude tissue 
extract containing TF activity), resulting in fibrinogen 
10 consumption and death. Pretreatment of rats with anti- 
TF antibody was demonstrated to protect rats from 
fibrinogen consumption and death as follows. 



in U.S. Patent 5,223,427. Saline control or 30 ji/ml of 
15 TF8-5G9 or CDR-grafted antibody was injected through the 
tail vein of rats, followed by injection of 
thromboplastin equivalent to 200 ng of recombinant TF. 
Clotting times were determined at T=0 and T=l minute as 
a measure of fibrinogen concentration. Clotting times 
20 are proportional to fibrinogen concentration, with a 60 
second clotting time corresponding to an 80% reduction 
in fibrinogen concentration. Clotting times of greater 
than 60 seconds cannot be accurately measured and were 
recorded as 60 seconds. 
25 Survivability and clotting times for three 

representative studies are shown below. 



Human thromboplastin was prepared as described 



Survivors 



Study 



Controls 



TF8-5G9 



CDR-grafted 
Ab 



30 



1 
2 
3 



0/8 
0/8 
0/8 



5/8 
4/7 
8/8 



6/8 
7/8 
3/7 
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1 

Study #1 
T=0 T=l 



5 



10 



16 


>60 


16 


>60 


16 


>60 


17 


>60 


15 


>60 


16 


>60 


16 


>60 


16 


>60 


Study #1 


T=0 


T=l 


16 


36 


15 


41 


15 


33 


15 


31 


15 


>60 


16 


>60 


16 


33 


16 


33 


16 


>60 



Study #1 

T=0 T=l 

25 16 >60 

16 >60 

16 >60 

22 37 

16 32 

15 >60 

16 >60 
„ 16 >60 

30 



Clotting Times 
Controls 



Study #2 


Study #3 


T=0 T=l 


T=0 T=l 


18 >60 


19 >60 


18 >60 


21 >60 


18 >60 


18 >60 


18 >60 


19 >60 


16 >60 


18 54 


18 >60 


18 >60 


17 >60 


18 >60 


17 >60 


18 >60 



Clotting Times 
Murine TF8-5G9 

Study #2 
T=0 T=l 

18 34 

18 36 

18 >60 

17 >60 

18 50 
17 34 

17 34 

18 31 



Study #3 

T=0 T=l 

19 28 

18 29 

19 29 
18 29 

18 28 

19 40 
19 40 
19 34 
19 >60 



Clotting Times 
CDR-grafted TF8-5G9 



Study #2 


Study #3 


T=0 T=l 


T=0 T=l 


17 >60 


21 >60 


17 33 


18 34 


18 32 


17 >60 


18 >60 


20 35 


17 32 


17 58 


18 31 


18 33 


17 31 


18 31 


16 32 
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Twenty- three of the twenty- four control rats 
1 had clotting times of greater than 60 seconds indicating 
that virtually all untreated rats were consuming more 
than 80% of their fibrinogen. Both the CDR-grafted and 
murine antibody treated rats had similar clotting times 
5 at one minute of 44.5 and 40 seconds. Further, only six 
of the murine antibody treated rats and nine of -the CDR- 
grafted antibody treated rats had clotting times in 
excess of 60 seconds. Accordingly, both the murine and 
CDR-grafted antibodies were able to neutralize TF and 
10 thus protect rats from fibrinogen consumption and death. 



15 



20 



25 



30 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Joliffe, Linda K. 

Zivin, Robert A. 
^ Pulito, Virginia L. 

(ii) TITLE OF INVENTION: CDR-GRAFTED ANTI-TISSUE FACTOR 
ANTIBODIES AND METHODS OF USE THEREOF 

(iii) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scully, Scott, Murphy & Presser 

iq (B) STREET: 400 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 11530 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



15 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 07-JUN-1995 

(C) CLASSIFICATION: 

( Viii )- ATTORNEY /AGENT INFORMATION : 
on ( A ) NAME: DiGiglio, Frank S. 

(B) REGISTRATION NUMBER: 31,346 

(C) REFERENCE /DOCKET NUMBER: 9598 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 230 901 SANS UR 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 11. ,1391 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGTCCTTACA ATG AAA TGC AGC TGG GTC ATC TTC TTC CTG ATG GCA GTG 49 
Met Lys Cys Ser Trp Val He Phe Phe Leu Met Ala Val 
15 10 

GTT ACA GGG GTC AAT TCA GAG ATT CAG CTG CAG CAG TCT GGG GCT GAG 97 
Val Thr Gly Val Asn Ser Glu He Gin Leu Gin Gin Ser Gly Ala Glu 
15 20 25 

CTT GTG AGG CCA GGG GCC TTA GTC AAG TTG TCC TGC AAA GCT TCT GGC 145 
Leu Val Arg Pro Gly Ala Leu Val Lys Leu Ser Cys Lys Ala Ser Gly 
30 35 40 45 

TTC AAC ATT AAA GAC TAC TAT ATG CAC TGG GTG AAG CAG AGG CCT GAA 193 
Phe Asn He Lys Asp Tyr Tyr Met His Trp Val Lys Gin Arg Pro Glu 
50 55 60 

CAG GGC CTG GAG TGG ATT GGA TTG ATT GAT CCT GAG AAT GGT AAT ACT 241 
Gin Gly Leu Glu Trp He Gly Leu He Asp Pro Glu Asn Gly Asn Thr 
65 70 75 

ATA TAT GAC CCG AAG TTC CAG GGC AAG GCC AGT ATA ACA GCA GAC ACA 289 
He Tyr Asp Pro Lys Phe Gin Gly Lys Ala Ser He Thr Ala Asp Thr 
80 85 90 

TCC TCC AAC ACA GCC TAC CTG CAG CTC AGC AGC CTG ACA TCT GAG GAC 337 
Ser Ser Asn Thr Ala Tyr Leu Gin Leu Ser Ser Leu Thr Ser Glu Asp 
95 100 105 

ACT GCC GTC TAT TAC TGT GCT AGA GAT AAC TCG TAC TAC TTT GAC TAC 385 
Thr Ala Val Tyr Tyr Cys Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr 
HO 115 120 125 
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TGG GGC CAA GGC ACC ACT CTC ACA GTC TCC TCA GCC AAA ACG ACA CCC 433 
Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr Pro 
1 130 135 140 

CCA TCT GTC TAT CCA CTG GCC CCT GGA TCT GCT GCC CAA ACT AAC TCC 481 
Pro Ser Val Tyr Pro Leu Ala Pro Gly Ser Ala Ala Gin Thr Asn Ser 
145 150 155 

ATG GTG ACC CTG GGA TGC CTG GTC AAG GGC TAT TTC CCT GAG CCA GTG 529 
5 Met Val Thr Leu Gly Cys Leu Val Lys Gly Tyr Phe Pro Glu Pro Val 
160 165 170 

ACA GTG ACC TGG AAC TCT GGA TCC CTG TCC AGC GGT GTG CAC ACC TTC 577 
Thr Val Thr Trp Asn Ser Gly Ser Leu Ser Ser Gly Val His Thr Phe 
175 180 185 

CCA GCT GTC CTG CAG TCT GAC CTC TAC ACT CTG AGC AGC TCA GTG ACT 625 
Pro Ala Val Leu Gin Ser Asp Leu Tyr Thr Leu Ser Ser Ser Val Thr 
10 1^0 195 200 205 

GTG CCC TCC AGC ACC TGG CCC AGC GAG ACC GTC ACC TGC AAC GTT GCC 673 
Val Pro Ser Ser Thr Trp Pro Ser Glu Thr Val Thr Cys Asn Val Ala 
210 215 220 

CAC CCG GCC AGC AGC ACC AAG GTG GAC AAG AAA ATT GTG CCC AGG GAT 721 

His Pro Ala Ser Ser Thr Lys Val Asp Lys Lys lie Val Pro Arg Asp 

225 230 235 

15 

TGT GGT TGT AAG CCT TGC ATA TGT ACA GTC CCA GAA GTA TCA TCT GTC 769 

Cys Gly Cys Lys Pro Cys lie Cys Thr Val Pro Glu Val Ser Ser Val 

240 245 250 

TTC ATC TTC CCC CCA AAG CCC AAG GAT GTG CTC ACC ATT ACT CTG ACT 817 
Phe He Phe Pro Pro Lys Pro Lys Asp Val Leu Thr He Thr Leu Thr 
255 260 265 

20 CCT AAG GTC ACG TGT GTT GTG GTA GAC ATC AGC AAG GAT GAT CCC GAG 865 
Pro Lys Val Thr Cys Val Val Val Asp He Ser Lys Asp Asp Pro Glu 
270 275 280 285 

GTC CAG TTC AGC TGG TTT GTA GAT GAT GTG GAG GTG CAC ACA GCT CAG 913 
Val Gin Phe Ser Trp Phe Val Asp Asp Val Glu Val His Thr Ala Gin 
290 295 300 

ACG CAA CCC CGG GAG GAG CAG TTC AAC AGC ACT TTC CGC TCA GTC AGT 961 
25 Thr Gin Pro Arg Glu Glu Gin Phe Asn Ser Thr Phe Arg Ser Val Ser 
305 310 315 
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GAA CTT CCC ATC ATG CAC CAG GAC TGG CTC AAT GGC AAG GAG TTC AAA 1009 
Glu Leu Pro lie Met His Gin Asp Trp Leu Asn Gly Lys Glu Phe Lys 
320 325 330 

TGC AGG GTC AAC AGT GCA GCT TTC CCT GCC CCC ATC GAG AAA ACC ATC 1057 
Cys Arg Val Asn Ser Ala Ala Phe Pro Ala Pro He Glu Lys Thr He 
335 340 345 

TCC AAA ACC AAA GGC AGA CCG AAG GCT CCA CAG GTG TAC ACC ATT CCA 1105 
Ser Lys Thr Lys Gly Arg Pro Lys Ala Pro Gin Val Tyr Thr He Pro 
350 355 360 365 

CCT CCC AAG GAG CAG ATG GCC AAG GAT AAA GTC AGT CTG AAC TGC ATG 1153 
Pro Pro Lys Glu Gin Met Ala Lys Asp Lys Val Ser Leu Asn Cys Met 
370 375 380 

ATA ACA GAC TTC TTC CCT GAA GAC ATT ACT GTG GAG TGG CAG TGG AAT 1201 
He Thr Asp Phe Phe Pro Glu Asp He Thr Val Glu Trp Gin Trp Asn 
385 390 395 

GGG CAG CCA GCG GAG AAC TAC AAG AAC ACT CAG CCC ATC ATG GAC ACA 1249 
Gly Gin Pro Ala Glu Asn Tyr Lys Asn Thr Gin Pro He Met Asp Thr 
400 405 410 

GAT GGC TCT TAC TTC GTC TAC AGC AAG CTC AAT GTG CAG AAG AGC AAC 1297 
Asp Gly Ser Tyr Phe Val Tyr Ser Lys Leu Asn Val Gin Lys Ser Asn 
415 420 425 

TGG GAG GCA GGA AAT ACT TTC ACC TGC TCT GTG TTA CAT GAG GGC CTG 1345 
Trp Glu Ala Gly Asn Thr Phe Thr Cys Ser Val Leu His Glu Gly Leu 
430 435 440 445 

CAC AAC CAC CAT ACT GAG AAG AGC CTC TCC CAC TCT CCT GGT AAA T 1391 
His Asn His His Thr Glu Lys Ser Leu Ser His Ser Pro Gly Lys 
450 455 460 

GATCCCAGTG TCCTTGGAGC CCTCTGGTCC TACAGGACTC TGACACCTAC CTCCACCCCT 1451 

CCCTGTATAA ATAAAGCACC CAGCACTGCC TTGGACCC 1489 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Cys Ser Trp Val lie Phe Phe Leu Met Ala Val Val Thr Gly 
15 10 15 

Val Asn Ser Glu lie Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg 
5 20 25 30 

Pro Gly Ala Leu Val Lys Leu Ser Cys Lys Ala Ser Gly Phe Asn He 
35 40 45 

Lys Asp Tyr Tyr Met His Trp Val Lys Gin Arg Pro Glu Gin Gly Leu 
50 55 60 

Glu Trp He Gly Leu He Asp Pro Glu Asn Gly Asn Thr He Tyr Asp 
10 65 70 75 ^ 80 

Pro Lys Phe Gin Gly Lys Ala Ser He Thr Ala Asp Thr Ser Ser Asn 
85 90 ~ 95 

Thr Ala Tyr Leu Gin Leu Ser Ser Leu Thr Ser Glu Asp Thr Ala Val 
100 105 110 

Tyr Tyr Cys Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly Gin 
15 115 120 125 

Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr Pro Pro Ser Val 
130 135 140 

Tyr Pro Leu Ala Pro Gly Ser Ala Ala Gin Thr Asn Ser Met Val Thr 
145 150 155 160 

Leu Gly ~Cys Leu Val Lys Gly Tyr Phe Pro Glu Pro Val Thr Val Thr 
20 165 170 175 

Trp Asn Ser Gly Ser Leu Ser Ser Gly Val His Thr Phe Pro Ala Val 
180 185 190 

Leu Gin Ser Asp Leu Tyr Thr Leu Ser Ser Ser Val Thr Val Pro Ser 
195 200 205 

Ser Thr Trp Pro Ser Glu Thr Val Thr Cys Asn Val Ala His Pro Ala 
25 210 215 220 
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Ser Ser Thr Lys Val Asp Lys Lys He Val Pro Arg Asp Cys Gly Cys 
225 230 235 " 240 

Lys Pro Cys He Cys Thr Val Pro Glu Val Ser Ser Val Phe He Phe 
245 250 255 

Pro Pro Lys Pro Lys Asp Val Leu Thr He Thr Leu Thr Pro Lys Val 
260 265 270 

5 Thr Cys Val Val Val Asp He Ser Lys Asp Asp Pro Glu Val Gin Phe 
275 280 285 

Ser Trp Phe Val Asp Asp Val Glu Val His Thr Ala Gin Thr Gin Pro 
290 295 300 

Arg Glu Glu Gin Phe Asn Ser Thr Phe Arg Ser Val Ser Glu Leu Pro 
305 310 315 320 

10 He Met His Gin Asp Trp Leu Asn Gly Lys Glu Phe Lys Cys Arg Val 

325 330 335 

Asn Ser Ala Ala Phe Pro Ala Pro He Glu Lys Thr He Ser Lys Thr 
340 345 * 350 

Lys Gly Arg Pro Lys Ala Pro Gin Val Tyr Thr He Pro Pro Pro Lys 
355 360 365 

15 Glu Gin Met Ala Lys Asp Lys Val Ser Leu Asn Cys Met He Thr Asp 
370 375 380 

Phe Phe Pro Glu Asp He Thr Val Glu Trp Gin Trp Asn Gly Gin Pro 
385 390 395 400 

Ala Glu Asn Tyr Lys Asn Thr Gin Pro He Met Asp Thr Asp Gly Ser 
405 410 415 

20 Tyr Pne Val Tyr Ser Lys Leu Asn Val Gin Lys Ser Asn Trp Glu Ala 
420 425 430 

Gly Asn Thr Phe Thr Cys Ser Val Leu His Glu Gly Leu His Asn His 
435 440 445 

His Thr Glu LyB Ser Leu Ser His Ser Pro Gly Lys 
450 455 460 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 5.. 706 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GGAC ATG CGG GCC CCT GCT CAG TTT TTT GGG ATC TTG TTG CTC TGG TTT 49 
Met Arg Ala Pro Ala Gin Phe Phe Gly lie Leu Leu Leu Trp Phe 
1 5 10 15 

CCA GGT ATC AGA TGT GAC ATC AAG ATG ACC CAG TCT CCA TCC TCC ATG 97 
Pro Gly He Arg Cys Asp He Lys Met Thr Gin Ser Pro Ser Ser Met 
20 25 30 

TAT GCA TCG CTG GGA GAG AGA GTC ACT ATC ACT TGT AAG GCG AGT CAG 145 
Tyr Ala Ser Leu Gly Glu Arg Val Thr He Thr Cys Lys Ala Ser Gin 
35 40 ' 45 

GAC ATT AGA AAG TAT TTA AAC TGG TAC CAG CAG AAA CCA TGG AAA TCT 193 
Asp He Arg Lys Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Trp Lys Ser 
50 55 60 

CCT AAG ACC CTG ATC TAT TAT GCA ACA AGC TTG GCA GAT GGG GTC CCA 241 
Pro Lys Thr Leu He Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro 
65 70 75 

TCA AGA TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTA ACC ATC 289 
Pne Ser G1 y Ser Gl y Ser Gl y Gln * B P Tyr Ser Leu Thr He 
80 85 90 95 

AGC AGC CTG GAG TCT GAC GAT ACA GCA ACT TAT TAC TGT CTA CAA CAT 337 
Ser Ser Leu Glu Ser Asp Asp Thr Ala Thr Tyr Tyr Cys Leu Gin His 
100 105 HO 

GGT GAG AGC CCG TAC ACG TTC GGA GGG GGG ACC AAG CTG GAA ATA AAC 385 
Gly Glu Ser Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Asn 
115 120 125 
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AGG GCT GAT GCT GCA CCA ACT GTA TCC ATC TTC CCA CCA TCC AGT GAG 433 
Arg Ala Asp Ala Ala Pro Thr Val Ser lie Phe Pro Pro Ser Ser Glu 
1 130 135 140 

CAG TTA ACA TCT GGA GGT GCC TCA GTC GTG TGC TTC TTG AAC AAC TTC 481 
Gin Leu Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn Phe 
145 150 155 

TAC CCC AAA GAC ATC AAT GTC AAG TGG AAG ATT GAT GGC AGT GAA CGA 529 
5 Tyr Pro Lys Asp lie Asn Val Lys Trp Lys lie Asp Gly Ser Glu Arg 
160 165 170 175 

CAA AAT GGC GTC CTG AAC AGT TGG ACT GAT CAG GAC AGC AAA GAC AGC 577 
Gin Asn Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser 
180 185 190 

ACC TAC AGC ATG AGC AGC ACC CTC ACG TTG ACC AAG GAC GAG TAT GAA 625 
Thr Tyr Ser Met Ser Ser Thr Leu Thr Leu Thr Lys Asp Glu Tyr Glu 
10 195 200 205 

CGA CAT AAC AGC TAT ACC TGT GAG GCC ACT CAC AAG ACA TCA ACT TCA 673 
Arg His Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser 
210 215 220 

CCC AAT GTC AAG AGC TTC AAC AAG AAT GAG TGT TAGAGACAAA GGTCCTGAGA 726 
Pro Asn Val Lys Ser Phe Asn Lys Asn Glu Cys 
225 230 

15 

CGCCACCACC AGCTCCCCAG CTCCATCCTA TCTTCCCTTC TAAGGTCTTG GAGGCTTCCC 786 
CACAAGCGAC CTACCACTGT TGCGGTGCTC CAAACCTCCT CCCCACCTCC TTCTCCTCCT 846 
CCTCCCTTTC CTTGGCTTTT ATCATGCTAA TATTTGCAGA AAATATTCAA TAAAGTGAGT 906 
CTTTGCACTT GAAAAAAAAA AAAAAAAAAA A 937 

20 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

1 Met Arg Ala Pro Ala Gin Phe Phe Gly He Leu Leu Leu Trp Phe Pro 
1 5 10 is 

Gly He Arg Cys Asp He Lya Met Thr Gin Ser Pro Ser Ser Met Tyr 
20 25 30 

Ala Ser Leu Gly Glu Arg Val Thr He Thr Cys Lys Ala Ser Gin Asp 
5 3 5 40 45 

He Arg Lys Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Trp Lys Ser Pro 
50 55 60 

Lys Thr Leu He Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro Ser 
65 70 75 80 

Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr He Ser 
10 85 90 95 

Ser Leu Glu Ser Asp Asp Thr Ala Thr Tyr Tyr Cys Leu Gin His Gly 
100 105 HO 

Glu Ser Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Asn Arq 
115 120 125 

Ala Asp Ala Ala Pro Thr Val Ser He Phe Pro Pro Ser Ser Glu Gin 
15 I 30 135 140 

Leu Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn Phe Tyr 
145 150 155 160 

Pro Lys Asp He Asn Val Lys Trp Lys He Asp Gly Ser Glu Arg Gin 
165 170 175 

Asn Gly Val Leu A B n Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser Thr 
20 180 185 190 

Tyr Ser Met Ser Ser Thr Leu Thr Leu Thr Lys Asp Glu Tyr Glu Ara 
195 200 205 

His Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser Pro 
210 215 220 

Asn Val Lys Ser Phe Asn Lys Asn Glu Cys 
25 225 230 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Asp Tyr Met His 
1 5 



10 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Leu He Asp Pro Glu Asn Gly Asn Thr He Tyr Lys Pro Lys Phe Gin 
1 5 io is 

Gly 



20 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(ix) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp Asn Ser Tyr Tyr Phe Asp Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Lys Ala Ser Gin Asp He Arg Lys Tyr Leu Asn 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 9: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Tyr Ala Thr Ser Leu Ala Asp 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Leu Gin His Gly Glu Ser Pro Tyr Thr 
1 5 



10 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



20 



25 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
1 5 10 15 

Leu Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn lie Ly b Asp Tyr 
20 25 30 

Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Leu lie Asp Pro Glu Asn Gly Asn Thr lie Tyr Asp Pro Lys Phe 
50 55 60 

Gin Gly Arg Phe Ser He Ser Ala Asp Thr Ser Lys Asn Thr Ala Phe 
65 70 75 80 



30 



35 
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Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr Pro 
100 105 ~ HO 

Val Thr Val Ser Ser 
115 

5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

15 Asp Arg Val Thr He Thr Cys Lys Ala Ser Gin Asp He Arg Lys Tyr 

20 25 30 

Leu Asn Trp Tyr Gin Gin Lys Pro Trp Lys Ala Pro Lys Thr Leu He 
35 40 45 

Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly 
.50 55 60 

20 Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 

65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Leu Gin His Gly Glu Ser Pro Tyr 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Glu He Thr Arg 
100 105 

25 
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10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
1 5 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn lie Lys Asp Tyr 
20 25 30 

Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Leu lie Asp Pro Glu ABn Gly Asn Thr lie Tyr Asp Pro Lys Phe 
50 55 60 

Gin Gly Arg Phe Thr He Ser Ala Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Asp Asn Ser Tyr Tyr Phe ABp Tyr Trp Gly Gin Gly Thr Pro 
100 105 110 

Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 



30 
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10 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr He Thr Cys Lys Ala Ser Gin Asp He Arg Lys Tyr 
20 25 30 

Leu Asn Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He 
35 40 45 

Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 
65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Leu Gin His Gly Glu Ser Pro Tyr 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Glu He Thr Arg 
100 105 



15 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 61.. 717 



25 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1111.. 1146 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

1 (B) LOCATION: 1268.. 1594 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1692.. 2012 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAATTCGCCT CCACCATGGA ATGGAGCTGG GTCTTTCTCT TCTTCTTGTC AGTAACTACA 60 

GGT GTA CAC TCA CAA GTT CAG CTG GTG GAG TCT GGA GGA GGA GTA GTA 108 
Gly Val His Ser Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val 
1 5 10 15 

CAA CCT GGA AGG TCA CTG AGA CTG TCT TGT AAG GCT AGT GGA TTC AAT 156 
10 Gin Pro Gly Arg Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn 
20 25 30 

ATC AAG GAC TAT TAT ATG CAC TGG GTC AGA CAA GCT CCT GGA AAA GGA 204 
lie Lys Asp Tyr Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly 
35 40 45 

CTC GAG TGG ATA GGT TTA ATT GAT CCT GAG AAT GGT AAC ACG ATA TAT 252 
Leu Glu Trp He Gly Leu He Asp Pro Glu Asn Gly Asn Thr He Tyr 
15 50 55 60 

GAT CCC AAG TTC CAA GGA AGA TTC ATA ATT TCT GCA GAC AAC TCT AAG 300 
Asp Pro Lys Phe Gin Gly Arg Phe He He Ser Ala Asp Asn Ser Lys 
65 70 75 80 

AAT ACA CTG TTC CTG CAG ATG GAC TCA CTC AGA CCT GAG GAT ACA GCA 348 
Asn Thr Leu Phe Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala 
85 90 95 

20 

GTC TAC TTT TGT GCT AGA GAT AAC AGT TAT TAC TTC GAC TAC TGG GGC 396 
Val Tyr Phe Cys Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly 
100 105 * 110 

CAA GGA ACA CCA GTC ACC GTG AGC TCA GCT TCC ACC AAG GGC CCA TCC 444 
Gin Gly Thr Pro Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 
115 120 125 

25 GTC TTC CCC CTG GCG CCC TGC TCC AGG AGC ACC TCC GAG AGC ACA GCC 492 
Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Glu Ser Thr Ala 
130 135 140 
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GCC CTG GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG 540 
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val 
145 150 155 160 

TCG TGG AAC TCA GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT 588 
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala 
165 170 175 

GTC CTA CAG TCC TCA GGA CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG 636 
Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val 
180 185 190 

CCC TCC AGC AGC TTG GGC ACG AAG ACC TAC ACC TGC AAC GTA GAT CAC 684 
Pro Ser Ser Ser Leu Gly Thr Lys Thr Tyr Thr Cys Asn Val Asp His 
195 200 205 

AAG CCC AGC AAC ACC AAG GTG GAC AAG AGA GTT GGTGAGAGGC CAGCACAGGG 737 
Lys Pro Ser Asn Thr Lys Val Asp Lys Arg Val 
210 215 

CAGGGAGGGT GTCTGCTGGA AGCCAGGCTC AGCCCTCCTG CCTGGACGCA CCCCGGCTGT 797 

GCAGCCCCAG CCCAGGGCAG CAAGGCATGC CCCATCTGTC TCCTCACCCG GAGGCCTCTG 857 

ACCACCCCAC TCATGCTCAG GGAGAGGGTC TTCTGGATTT TTCCACCAGG CTCCGGGCAG 917 

CCACAGGCTG GATGCCCCTA CCCCAGGCCC TGCGCATACA GGGGCAGGTG CTGCGCTCAG 977 

ACCTGCCAAG AGCCATATCC GGGAGGACCC TGCCCCTGAC CTAAGCCCAC CCCAAAGGCC 1037 

AAACTCTCCA CTCCCTCAGC TCAGACACCT TCTCTCCTCC CAGATTCGAG TAACTCCCAA 1097 

TCTTCTCTCT GCA GAG TCC AAA TAT GGT CCC CCA TGC CCA TCA TGC CCA 1146 
Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser Cys Pro 
15 10 

GGTAAGCCAA CCCAGGCCTC GCCCTCCAGC TCAAGGCGGG ACAGGTGCCC TAGAGTAGCC 1206 

TGCATCCAGG GACAGGCCCC AGCCGGGTGC TGACGCATCC ACCTCCATCT CTTCCTCAGC 1266 

A CCT GAG TTC CTG GGG GGA CCA TCA GTC TTC CTG TTC CCC CCA AAA 1312 
Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys 
15 10 15 

CCC AAG GAC ACT CTC ATG ATC TCC CGG ACC CCT GAG GTC ACG TGC GTG 1360 
Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val 
20 25 30 
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GTG GTG GAC GTG AGC CAG GAA GAC CCC GAG GTC CAG TTC AAC TGG TAC 1408 
VaX Val Asp Val Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr 
35 40 45 

GTG GAT GGC GTG GAG GTG CAT AAT GCC AAG ACA AAG CCG CGG GAG GAG 1456 
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu 
50 55 ^ 60 

CAG TTC AAC AGC ACG TAC CGT GTG GTC AGC GTC CTC ACC GTC ATG CAC 1504 
Gin Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Met His 
65 70 75 

CAG GAC TGG CTG AAC GGC AAG GAG TAC AAG TGC AAG GTC TCC AAC AAA 1552 
Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys 
80 85 90 95 

GGC CTC CCG TCC TCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 1594 
Gly Leu Pro Ser Ser lie Glu Lys Thr lie Ser Lys Ala Lys 
100 105 

GGTGGGACCC ACGGGGTGCG AGGGCCACAT GGACAGAGGT CAGCTCGGCC CACCCTCTGC 1654 

CCTGGGAGTG ACCGCTGTGC CAACCTCTGT CCCTACA GGG CAG CCC CGA GAG CCA 1709 

Gly Gin Pro Arg Glu Pro 
1 5 

CAG GTG TAC ACC CTG CCC CCA TCC CAG GAG GAG ATG ACC AAG AAC CAG 1757 
Gin Val Tyr Thr Leu Pro Pro Ser Gin Glu Glu Met Thr Lys Asn Gin 
10 15 20 

GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAC CCC AGC GAC ATC GCC 1805 
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala 
25 30 35 

GTG GAG TGG GAG AGT AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 1853 
Val. Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr 
40 45 50 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AGG CTA 1901 
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg Leu 
55 60 65 70 

ACC GTG GAC AAG AGC AGG TGG CAG GAG GGG AAT GTC TTC TCA GTC TCC 1949 
Thr Val Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe Ser Val Ser 
75 80 85 
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GTG ATG CAT GAG GCT CTG CAC AAC CAC TAC ACA CAG AAG AGC CTC TCC 1997 
Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser 
90 95 100 

CTG TCT CTG GGT AAA TGAGTGCCAG GGCCGGCAAG CCCCCGCTCC CCGGGCTCTC 2052 
Leu Ser Leu Gly Lys 
105 

GGGGTCGCGC GAGGATGCTT GGCACGTACC CCGTCTACAT ACTTCCCAGG CACCCAGCAT 2112 

GGAAATAAAG CACCCACCAC TGCCCTGGGC CCCTGTGAGA CTGTGATGGT TCTTTCCACG 2172 

GGTCAGGCCG AGTCTGAGGC CTGAGTGACA TGAGGGAGGC AGAGCGGGTC CCACTGTCCC 2232 

CACACTGGCC CAGGCTGTGC AGGTGTGCCT GGGCCACCTA GGGTGGGGCT CAGCCAGGGG 2292 

CTGCCCTCGG CAGGGTGGGG GATTTGCCAG CGTGGCCCTC CCTCCAGCAG CAGGACTCTA 2352 

GAGGATCATA ATCAGCCATA CCACATTTGT AGAGGTTTTA CTTGCTTTAA AAAACCTCCC 2412 

ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT 2472 

TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT 2532 

TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG 2592 

GATCCTCTAC GCCGGACGCA TCGTGGCCGG CATCACCGGC GCCACAGGTG CGGTTGCTGG 2652 

CGCCTATATC GCCGACATCA CCGATGGGGA AGATCGGGCT CGCCACTTCG GGCTCATGAG 2712 

CGCTTGTTTC GGCGTGGGTA TGGTGGCAGG CCCGTGGCCG GGGGACTGTT GGGCGCCATC 2772 

TCCTTGCATG CACCATTCCT TGCGGCGGCG GTGCTCAACG GCCTCAACCT ACTACTGGGC 2832 

TGCTTCCTAA TGCAGGAGTC GCATAAGGGA GAGCGTCGAC CTCGGGCCGC GTTGCTGGCG 2892 

TTTTTCCATA GGCTCCGCCC CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG 2952 

TGGCGAAACC CGACAGGACT ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 3012 

CGCTCTCCTG TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 3072 

AGCGTGGCGC TTTCTCAATG CTCACGCTGT AGGTATCTCA GTTCGGTGTA GGTCGTTCGC 3132 

TCCAAGCTGG GCTGTGTGCA CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 3192 

AACTATCGTC TTGAGTCCAA CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT 3252 
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GGTAACAGGA 


TTAGCAGAGC 


GAGGTATGTA 


GGCGGTGCTA 


CAGAGTTCTT 


GAAGTGGTGG 


3312 


CCTAACTACG 


GCTACACTAG 


AAGGACAGTA 


TTTGGTATCT 


GCGCTCTGCT 


GAAGCCAGTT 


3372 


ACCTTCGGAA 


AAAGAGTTGG 


TAGCTCTTGA 


TCCGGCAAAC 


AAACCACCGC 


TGGTAGCGGT 


3432 


GGTTTTTTTG 


TTTGCAAGCA 


GCAGATTACG 


CGCAGAAAAA 


AAGGATCTCA 


AGAAGATCCT 


3492 


TTGATCTTTT 


CTACGGGGTC 


TGACGCTCAG 


TGGAACGAAA 


ACTCACGTTA 


AGGGATTTTG 


3552 


GTCATGAGAT 


TATCAAAAAG 


GATCTTCACC 


TAGATCCTTT 


TAAATTAAAA 


ATGAAGTTTT 


3612 


AAATCAATCT 


AAAGTATATA 


TGAGTAAACT 


TGGTCTGACA 


GTTACCAATG 


CTTAATCAGT 


3672 


GAGGCACCTA 


TCTCAGCGAT 


CTGTCTATTT 


CGTTCATCCA 


TAGTTGCCTG 


ACTCCCCGTC 


3732 


GTGTAGATAA 


CTACGATACG 


GGAGGGCTTA 


CCATCTGGCC 


CCAGTGCTGC 


AATGATACCG 


3792 


CGAGACCCAC 


GCTCACCGGC 


TCCAGATTTA 


TCAGCAATAA 


ACCAGCCAGC 


CGGAAGGGCC 


3852 


GAGCGCAGAA 


GTGGTCCTGC 


AACTTTATCC 


GCCTCCATCC 


AGTCTATTAA 


TTGTTGCCGG 


3912 


GAAGCTAGAG 


TAAGTAGTTC 


GCCAGTTAAT 


AGTTTGCGCA 


ACGTTGTTGC 


CATTGCTACA 


3972 


GGCATCGTGG 


TGTCACGCTC 


GTCGTTTGGT 


ATGGCATCAT 


TCAGCTCCGG 


TTCCCAACGA 


4032 


TCAAGGCGAG 


TTACATGATC 


CCCCATGTTG 


TGCAAAAAAG 


CGGTTAGCTC 


CTTCGGTCCT 


4092 


CCGATCGTTG 


TCAGAAGTAA 


GTTGGCCGCA 


GTGTTATCAC 

A V«* A A * * A V* 


TCATGGTTAT 


GGCAGCACTG 


4152 


CATAATTCTC 


TTACTGTCAT 


GCCATCCGTA 


AG ATGC TTTT 

** w*» 4* w v A A A A 


CTGTGACTGG 


TGAGTACTCA 


4212 


ACCAAGTCAT 


TCTGAGAATA 


GTGTATGCGG 


CGACCGAGTT 


GCTCTTGCCC 


GGCGTCAAGA 


4272 


CGGGATAATA 


CCGCGCCACA 


TAGCAGAACT 


TTAAAAGTGC 


TCATCATTGG 


AAAACGTTCT 


4332 


TCGGGGCGAA 


AACTCTCAAG 


GATCTTACCG 


CTGTTGAGAT 


CCAGTTCGAT 


GTAACCCACT 


4392 


CGTGCACCCA 


ACTGATCTTC 


AGCATCTTTT 


ACTTTCACCA 


GCGTTTCTGG 


GTGAGCAAAA 


4452 


ACAGGAAGGC 


AAAATGCCGC 


AAAAAAGGGA 


ATAAGGGCGA 


CACGGAAATG 


TTGAATACTC 


4512 


ATACTCTTCC 


TTTTTCAATA 


TTATTGAAGC 


ATTTATCAGG 


GTTATTGTCT 


CATGAGCGGA 


4572 


TACATATTTG 


AATGTATTTA 


GAAAAATAAA 


CAAATAGGGG 


TTCCGCGCAC 


ATTTCCCCGA 


4632 


AAAGTGCCAC 


CTGACGTCTA 


AGAAACCATT 


ATTATCATGA 


CATTAACCTA 


TAAAAATAGG 


4692 
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CGTATCACGA 


GGCCCTGATG 


GCTCTTTGCG 


GCACCPATPO 


TTPPT A ATPT 
X X UVj X AA X w X 


TPPP TPPP21P 
X www X ww wAw 


A1 C 1 
4 /OZ 


CGACGACAAC 


CCTCAAGAGA 


AAATGTAATC 


ACACTGGPTP 

nwnv x «J\>v X w 


APPTTPPPPT 
Aw w X X L-vjOvj X 


PPP PPTTTPT 
uwt^l 1 XwX 


4olz 


GCGTTTATAA 


GGAGACACTT 


T ATGT TT AAG 


AAGGTTGGTA 


AATTPPTTPP 

AAX X wwX X WW 


PPPTT TPP P A 
Vvw-wX X X Vjw wA 


HO 1 £ 


GCCAAGCTAG 


AGATCTCTAG 


CTTCGTGTCA 


AGGACGGTGA 


CTGCAGTGAA 

w X WW Aw IVJAn 


TA ATA A A ATP 
X AAX AAAA X VJ 




TGTGTTTGTC 


CGAAATACGC 


GTTTTGAGAT 


TTCTGTCGCC 


GACTAAATTC 


ATGTCGCGCP 

A X \7 X W W W W WW 




ATAGTGGTGT 


TTATCGCCGA 


TAGAGATGGC 


GATATTGGAA 


AAATPGATAT 

AAA X ww A X A X 


TTPAAAATAT 

X X V7AAAAX AX 




GGCATATTGA 


AAATGTCGCC 


GATGTGAGTT 


TCTGTGTAAC 


TGATATCGPC 

X VI A X A X wvL 


A TTTTTPP1 A 
A X X X X X w w Art 


DX±<£ 


AAGTGATTTT 


TGGGCATACG 


CGATATCTGG 


CGATAGCPPT 

wun x rio ww w x 


TAT ATPPTTT 

X A X A X V— • Vj XXX 


nLouuuun 1 w 


□ 172 


GCGATAGACG 


ACTTTGGTGA 


CTTGGGCGAT 


TPTGTGTPTP 

X w X w X w X w X w 


PPA AATATPP 

ww AAA InlLu 


XXX UjA 1 


b232 


ATAGGTGACA 


GACGATATGA 


GG CT AT ATCn 


PPPATAPAPP 

w A X Aw* Aw~ w~ 


pp APATHAar' 


C X wwCAC ATG 


5292 


GCCAATGCAT 


ATCGATCTAT 


ACATTGAATC 

nvri x x vinA x w 


AATATTPPPP 


A tt 2i p nn 21 TA 

AX X AwwwAXA 


TTATTCATTG 


c ^ c o 

5352 


GTTATATAGC 


ATAAATCAAT 


ATTPPPTATT 

Hi lUUvl AX X 


PPPPATTPPA 
VvVjwwAX IVjLA 


X A ww X X w X AX 


CCATATCATA 


5412 


ATATGTACAT 


TTATATTGGC 


TCATGTCCAA 

x wn x w x wwru* 


P ATT APPPPP 

wAX X AUUUvU 


A TP TTP 21 P 21 *P 

AXw X X wAwAl 


x bATTAT x GA 


5472 


CT AG TT ATT A 


ATAGTAATCA 

* * x #*w x nn x wii 


ATTAPPPPPT 


p 21 tt apttt'a 

wA X X Aw X X w A 


X AGCCCA1 Ax 


ATGGAGTTCC 


5532 


GCGTTACATA 


ACTTAP.GGTA 




tluuL X bALL 


wwwCaACGAC 


CCCCGCCCAT 


5592 


TGACGTCAAT 

w a wnn a 


AATGACGTAT 
aa x vinw ini 


P TTPrP 21 T IV P 
lax luuUnlnu 


X AAWjCLAnT 


AGGGACTTTC 


CATTGACGTC 


5652 


AATGGGTGGA 


P.TATTTAPPP 


TA A APTPPPP 
X AAA w X o w 


ACTTGGCAGT 


ACATCAAGTG 


TATCATATGC 


5712 


CAAGTACG C C 


PPPT & TTP & P 
www Inll unV/ 


l» X Liin X ACb 


GTAAATGGCC 


CGCCTGGCAT 


TATGCCCAGT 


5772 


ACATGACCTT 


ATGGGACTTT 


CCTACTTGGC 


AGTACATCTA 


CGTATTAGTC 


ATCGCTATTA 


5832 


CCATGGTGAT 


GCGGTTTTGG 


CAGTACATCA 


ATGGGCGTGG 


ATAGCGGTTT 


GACTCACGGG 


5892 


GATTTCCAAG 


TCTCCACCCC 


ATTGACGTCA 


ATGGGAGTTT 


GTTTTGGCAC 


CAAAATCAAC 


5952 


GGGACTTTCC 


AAAATGTCGT 


AACAACTCCG 


CCCCATTGAC 


GCAAATGGGC 


GGTAGGCGTG 


6012 


TACGGTGGGA 


GGTCTATATA 


AGCAGAGCTC 


GTTTAGTGAA 


CCGTCAGATC 


GCCTGGAGAC 


6072 


GCCATCCACG 


CTGTTTTGAC 


CTCCATAGAA 


GACACCGGGA 


CCGATCCAGC 


CTCCGCGGCC 


6132 
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GGGAACGGTG 


CATTGGAACG 


CGGATTCCCC 


GTGCCAAGAG 


TGACGTAAGT 


ACCGCCTATA 


6192 


GAGTCTATAG 


GCCCACCCCC 


TTGGCTTCTT 


ATGCATGCTA 


TACTGTTTTT 


GGCTTGGGGT 


6252 


CTATACACCC 


CCGCTTCCTC 


ATGTTATAGG 


TGATGGTATA 


GCTTAGCCTA 


TAGGTGTGGG 


6312 


TTATTGACCA 


TTATTGACCA 


CTCCCCTATT 


GGTGACGATA 


CTTTCCATTA 


CTAATCCATA 


6372 


ACATGGCTCT 


TTGCCACAAC 


TCTCTTTATT 


GGCTATATGC 


CAATACACTG 


TCCTTCAGAG 


6432 


ACTGACACGG 


ACTCTGTATT 


TTTACAGGAT 


GGGGTCTCAT 


TTATTATTTA 


CAAATTCACA 


6492 


TATACAACAC 


CACCGTCCCC 


AGTGCCCGCA 


GTTTTTATTA 


AACATAACGT 


GGGATCTCCA 


6552 


CGCGAATCTC 


GGGTACGTGT 


TCCGGACATG 


GGCTCTTCTC 


CGGTAGCGGC 


GGAGCTTCTA 


6612 


CATCCGAGCC 


CTGCTCCCAT 


CCCTCCAGCG 


ACTCATGGTC 


GCTCGGCAGC 


TCCTTGCTCC 


6672 


TAACAGTGGA 


GGCCAGACTT 


AGGCACAGCA 


CGATGCCCAC 


CACCACCAGT 


GTGCCGCACA 


6732 






CTC Tr^Tf* TV TV IV 




GG AG CGGG CT 


TGCACCGCTG 


6792 


ACGCATTTGG 


AAGACTTAAG 


GCAGCGGCAG 


AAGAAGATGC 


AGGCAGCTGA 


GTTGTTGTGT 


6852 


TCTGATAAGA 


GTCAGAGGTA 


ACTCCCGTTG 


CGGTGCTGTT 


AACGGTGGAG 


GGCAGTGTAG 


6912 


TCTGAGCAGT 


ACTCGTTGCT 


GCCGCGCGCG 


CCACCAGACA 


TAATAGCTGA 


CAGACTAACA 


6972 


GACTGTTCCT 


TTCCATGGGT 


CTTTTCTGCA 


GTCACCGTCC 


TTGACACGAA 


GCTTGGGCTG 


7032 


CAGGTCGATC 


GACTCTAGAG 


GATCGATCCC 


CGGGCGAGCT 


C 




7073 



(2) INFORMATION FOR SEQ ID NO: 16; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

1 Gly Val His Ser Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val 
1 5 10 15 

Gin Pro Gly Arg Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn 
20 25 30 

lie Lys Asp Tyr Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly 
5 35 40 45 

Leu Glu Trp lie Gly Leu lie Asp Pro Glu Asn Gly Asn Thr He Tyr 
50 55 60 

Asp Pro Lys Phe Gin Gly Arg Phe He He Ser Ala Asp Asn Ser Lys 
65 70 75 80 

Asn Thr Leu Phe Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala 
10 85 90 95 

Val Tyr Phe Cys Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly 
100 105 110 

Gin Gly Thr Pro Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 
115 120 125 

Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Glu Ser Thr Ala 
15 130 135 140 

Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val 
145 150 155 160 

Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala 
165 170 175 

Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val 
20 180 185 190 

Pro Ser Ser Ser Leu Gly Thr Lys Thr Tyr Thr Cys Asn Val Asp His 
195 200 205 

Lys Pro Ser Asn Thr Lys Val Asp Lys Arg Val 
210 215 

25 
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(2) INFORMATION FOR SEQ ID NO: 17: 

1 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser Cys Pro 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro LyB Pro 
15 10 15 

Lys Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val 
20 25 30 

Val Asp Val Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val 
20 35 40 45 

Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
50 55 60 

Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Met His Gin 
65 70 75 80 

Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly 
25 85 90 95 
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Leu Pro Ser Ser lie Glu Lys Thr lie ser Lys Ala Lys 
100 105 

1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Gin Glu 
10 1 5 10 15 

Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe 
20 25 30 

Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu 
35 40 45 

Asn ABn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe 
15 50 55 60 

Phe Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gin Glu Gly 
65 70 75 80 

Asn Val Phe Ser Val Ser Val Met His Glu Ala Leu His Asn His Tyr 
85 90 95 

Thr Gin Lys Ser Leu Ser Leu Ser Leu Gly Lys 
20 100 105 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7864 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 711 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AATTCACCAT 


GGGTGTGCCA 


ACTCAGGTAT 


TAGGATTACT 


GCTGCTGTGG 


CTTACAGATG 


60 


CAAGATGTGA 


TATCCAAATG 


ACACAATCTC 


CTTCTTCTCT 


AAGTGCTTCT 


GTCGGAGATA 


120 


GAGTAACAAT 


TACATGTAAG 


GCGAGTCAGG 


ACATTAGAAA 


GTATTTAAAC 


TGGTATCAGC 


180 


AAAAACCTGG 


GAAGGCTCCT 


AAGCTACTGA 


TTTATTATGC 


AACAAGTTTG 


GCAGATGGAG 


240 


TACCTTCTAG 


ATTTTCTGGT 


TCTGGCTCTG 


GAACAGACTA 


CACATTCACA 


ATTTCTTCTC 


300 


TCCAACCTGA 


GGACATTGCT 


ACATACTACT 


GCCTACAACA 


TGGTGAGAGT 


CCGTATACAT 


360 


TTGGACAAGG 


AACAAAACTA 


GAGATCACAA 


GAACTGTTGC 


GGCGCCGTCT 


GTCTTCATCT 


420 


TCCCGCCATC 


TGATGAGCAG 


TTGAAATCTG 


GAACTGCCTC 


TGTTGTGTGC 


CTGCTGAATA 


480 


ACTTCTATCC 


CAGAGAGGCC 


AAAGTACAGT 


GGAAGGTGGA 


TAACGCCCTC 


CAATCGGGTA 


540 


ACTCCCAGGA 


GAGTGTCACA 


GAGCAGGACA 


GCAAGGACAG 


CACCTACAGC 


CTCAGCAGCA 


600 


CCCTGACGCT 


GAGCAAAGCA 


GACTACGAGA 


AACACAAAGT 


CTACGCCTGC 


GAAGTCACCC 


660 


ATCAGGGCCT 


GAGCTCGCCC 


GTCACAAAGA 


GCTTCAACAG 


GGGAGAGTGT 


TAGAGGGAGA 


720 


AGTGCCCCCA 


CCTGCTCCTC 


AGTTCCAGCC 


TGGGGATCAT 


AATCAGCCAT 


ACCACATTTG 


780 


TAGAGGTTTT 


ACTTGCTTTA 


AAAAACCTCC 


CACACCTCCC 


CCTGAACCTG 


AAACATAAAA 


840 


TGAATGCAAT 


TGTTGTTGTT 


AACTTGTTTA 


TTGCAGCTTA 


TAATGGTTAC 


AAATAAAGCA 


900 


ATAGCATCAC 


AAATTTCACA 


AATAAAGCAT 


TTTTTTCACT 


GCATTCTAGT 


TGTGGTTTGT 


960 


CCAAACTCAT 


CAATGTATCT 


TATCATGTCT 


GGATCCTCTA 


CGCCGGACGC 


ATCGTGGCCG 


1020 


GCATCACCGG 


CGCCACAGGT 


GCGGTTGCTG 


GCGCCTATAT 


CGCCGACATC 


ACCGATGGGG 


1080 


AAGATCGGGC 


TCGCCACTTC 


GGGCTCATGA 


GCGCTTGTTT 


CGGCGTGGGT 


ATGGTGGCAG 


1140 
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GCCCGTGGCC 


GGGGGACTGT 


TGGGCGCCAT 


CTCCTTGCAT 


GCACCATTCC 


TTGCGGCGGC 


1200 


GGTGCTCAAC 


GGCCTCAACC 


TACTACTGGG 


CTGCTTCCTA 


ATGCAGGAGT 


CGCATAAGGG 


1260 


AGAGCGTCGA 


CCTCGGGCCG 


CGTTGCTGGC 


GTTTTTCCAT 


AGGCTCCGCC 


CCCCTGACGA 


1320 


GCATCACAAA 


AATCGACGCT 


CAAGTCAGAG 


GTGGCGAAAC 


CCGACAGGAC 


TATAAAGATA 


1380 


CCAGGCGTTT 


CCCCCTGGAA 


GCTCCCTCGT 


GCGCTCTCCT 


GTTCCGACCC 


TGCCGCTTAC 


1440 


CGGATACCTG 


TCCGCCTTTC 


TCCCTTCGGG 


AAGCGTGGCG 


CTTTCTCAAT 


GCTCACGCTG 


1500 


TAGGTATCTC 


AGTTCGGTGT 


AGGTCGTTCG 


CTCCAAGCTG 


GGCTGTGTGC 


ACGAACCCCC 


1560 


CGTTCAGCCC 


GACCGCTGCG 


CCTTATCCGG 


TAACTATCGT 


CTTGAGTCCA 


ACCCGGTAAG 


1620 


ACACGACTTA 


TCGCCACTGG 


CAGCAGCCAC 


TGGTAACAGG 


ATTAGCAGAG 


CGAGGTATGT 


1680 


AGGCGGTGCT 


ACAGAGTTCT 


TGAAGTGGTG 


GCCTAACTAC 


GGCTACACTA 


GAAGGACAGT 


1740 


ATTTGGTATC 


TGCGCTCTGC 


TGAAGCCAGT 


TACCTTCGGA 


AAAAGAGTTG 


GTAGCTCTTG 


1800 


ATCCGGCAAA 


CAAACCACCG 


CTGGTAGCGG 


TGGTTTTTTT 


GTTTGCAAGC 


AGCAGATTAC 


1860 


GCGCAGAAAA 


AAAGGATCTC 


AAGAAGATCC 


TTTGATCTTT 


TCTACGGGGT 


CTGACGCTCA 


1920 


GTGGAACGAA 


AACTCACGTT 


AAGGGATTTT 


GGTCATGAGA 


TTATCAAAAA 


GGATCTTCAC 


1980 


CTAGATCCTT 


TTAAATTAAA 


AATGAAGTTT 


TAAATCAATC 


TAAAGTATAT 

* * ***** VJ 4. A* A- A* X 


ATGAGTAAAC 


?OAO 


TTGGTCTGAC 


AGTTACCAAT 


GCTTAATCAG 


TGAGGCACCT 


ATCTC AG CG A 






TCGTTCATCC 


ATAGTTGCCT 


GACTCCCCGT 


CGTGTAGATA 


ACTACGATAC 


GGCZTifZfZfZl "I'M 1 


«C X ou 


ACCATCTGGC 


CCCAGTGCTG 


CAATGATACC 


GCGAGACCCA 


CGCTCACCGG 


PTPP Ar2ft r FT , T 
v* x wvnun xxx 








CC6GAAGGGC 


CGAGCGCAGA 


AGTGGTCCTG 


CAACTTTATC 


2280 


CGCCTCCATC 


CAGTCTATTA 


ATTGTTGCCG 


GGAAGCTAGA 


GTAAGTAGTT 


CGCCAGTTAA 


2340 


TAGTTTGCGC 


AACGTTGTTG 


CCATTGCTAC 


AGGCATCGTG 


GTGTCACGCT 


CGTCGTTTGG 


2400 


TATGGCTTCA 


TTCAGCTCCG 


GTTCCCAACG 


ATCAAGGCGA 


GTTACATGAT 


CCCCCATGTT 


2460 


GTGCAAAAAA 


GCGGTTAGCT 


CCTTCGGTCC 


TCCGATCGTT 


GTCAGAAGTA 


AGTTGGCCGC 


2520 


AGTGTTATCA 


CTCATGGTTA 


TGGCAGCACT 


GCATAATTCT 


CTTACTGTCA 


TGCCATCCGT 


2580 
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AAGATGCTTT 


TCTGTGACTG 


GTGAGTACTC 


AACCAAGTCA 


TTCTGAGAAT 


AGTGTATGCG 


2640 


GCGACCGAGT 


TGCTCTTGCC 


CGGCGTCAAC 


ACGGGATAAT 


ACCGCGCCAC 


ATAGCAGAAC 


2700 


TTTAAAAGTG 


CTCATCATTG 


GAAAACGTTC 


TTCGGGGCGA 


AAACTCTCAA 


GGATCTTACC 


2760 


GCTGTTGAGA 


TCCAGTTCGA 


TGTAACCCAC 


TCGTGCACCC 


AACTGATCTT 


CAGCATCTTT 


2820 


TACTTTCACC 


AGCGTTTCTG 


GGTGAGCAAA 


AACAGGAAGG 


CAAAATGCCG 


CAAAAAAGGG 


2880 


AATAAGGGCG 


ACACGGAAAT 


GTTGAATACT 


CATACTCTTC 


CTTTTTCAAT 


ATTATTGAAG 


2940 


CATTTATCAG 


GGTTATTGTC 


TCATGAGCGG 


ATACATATTT 


GAATGTATTT 


AGAAAAATAA 


3000 


ACAAATAGGG 


GTTCCGCGCA 


CATTTCCCCG 


AAAAGTGCCA 


CCTGACGTCT 


AAGAAACCAT 


3060 


TATTATCATG 


ACATTAACCT 


ATAAAAATAG 


GCGTATCACG 


AGGCCCTGAT 


GGCTCTTTGC 


3120 


GGCACCCATC 


GTTCGTAATG 


TTCCGTGGCA 


CCGAGGACAA 


CCCTCAAGAG 


AAAATGTAAT 


3180 


CACACTGGCT 


CACCTTCGGG 


TGGGCCTTTC 


TGCGTTTATA 


AGGAGACACT 


TTATGTTTAA 


3240 


GAAGGTTGGT 


AAATTCCTTG 


CGGCTTTGGC 


AGCCAAGCTA 


GAGATCCGGC 


TGTGGAATGT 


3300 


GTGTCAGTTA 


GGGTGTGGAA 


AGTCCCCAGG 


CTCCCCAGCA 


GGCAGAAGTA 


TGCAAAGCAT 


3360 


GCATCTCAAT 


TAGTCAGCAA 


CCAGGCTCCC 


CAGCAGGCAG 


AAGTATGCAA 


AGCATGCATC 


3420 


TCAATTAGTC 


AGCAACCATA 


GTCCCGCCCC 


TAACTCCGCC 


CATCCCGCCC 


CTAACTCCGC 


3480 


CCAGTTCCGC 


CCATTCTCCG 


CCCCATGGCT 


GACTAATTTT 


TTTTATTTAT 


GCAGAGGCCG 


3540 


AGGCCGCCTC 


GGCCTCTGAG 


CTATTCCAGA 


AGTAGTGAGG 


AGGCTTTTTT 


GGAGGCCTAG 


3600 


GCTTTTGCAA 


AAAGCTAGCT 


TGGGGCCACC 


GCTCAGAGCA 


CCTTCCACCA 


TGGCCACCTC 


3660 


AGCAAGTTCC 


CACTTGAACA 


AAAACATCAA 


GCAAATGTAC 


X Iwluwvlww 






GAAAGTCCAA 


GCCATGTATA 


TCTGGGTTGA 


TGGTACTGGA 


GAAGGACTGC 


GCTGCAAAAC 


3780 


CCGCACCCTG 


GACTGTGAGC 


CCAAGTGTGT 


AGAAGAGTTA 


CCTGAGTGGA 


ATTTTGATGG 


3840 


CTCTAGTACC 


TTTCAGTCTG 


AGGGCTCCAA 


CAGTGACATG 


TATCTCAGCC 


CTGTTGCCAT 


3900 


GTTTCGGGAC 


CCCTTCCGCA 


GAGATCCCAA 


CAAGCTGGTG 


TTCTGTGAAG 


TTTTCAAGTA 


3960 


CAACCGGAAG 


CCTGCAGAGA 


CCAATTTAAG 


GCACTCGTGT 


AAACGGATAA 


TGGACATGGT 


4020 
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25 



GAGCAACCAG 


CACCCCTGGT 


TTGGAATGGA 


GCACCCTTTT 


GGTTGGCCTT 


CCAATGGCTT 


TGTGGGCGCA 


GACAAAGCCT 


ATGGCAGGGA 


GTATGCTGGG 


GTCAAGATTA 


CAGGAACAAA 


CCAAATAGGA 


CCCTGTGAAG 


GAATCCGCAT 


CTTNCATCGA 


GTATGTGAAG 


ACTTTGGGGT 


TGGGAACTGG 


AATGGTGCAG 


GCTGCCATAC 


GAATGGTCTG 


AAGCACATCG 


AGGAGGCCAT 


CATTCGAGCC 


TACGATCCCA 


AGGGGGGCCT 


CGAAACGTCC 


AACATCAACG 


ACTTTTCTGC 


CATTCCCCCG 


ACTGTCGGCC 


AGGAGAAGAA 


CAATTGTGAC 


CCCTTTGCAG 


TGACAGAAGC 


TGGCCACGAG 


CCCTTCCAAT 


ACAAAAACTA 


TAGTTCATCC 


CACCCCGCCC 


CAGAGAGATC 


ACATAATTGG 


ACAAACTACC 


TACAGAGATT 


AGTGTATAAT 


GTGTTAAACT 


ACTGATTCTA 


GGAACTGATG 


AATGGGAGCA 


GTGGTGGAAT 


GAAGAAATGC 


CATCTAGTGA 


TGATGAGGCT 


AAAAAGAAGA 






AGTCATGCTG 


TGTTTAGTAA 


TAGAACTCTT 


AAAGCTGCAC 


TGCTATACAA 


GAAAATTATG 


CATAACAGTT 


ATAATCATAA 


CATACTGTTT 


GCTATTAATA 


ACTATGCTCA 


AAAATTGTGT 


AATAAGGAAT 


ATTTGATGTA 


TAGTGCCTAG 



ACAGGAGTAT 


ACTCTGATGG 


GAACAGATGG 


4080 


TCCTGGGCCC 


CAAGGTCCGT 


ATTACTGTGG 


4140 


TATCGTGGAG 


GCTCACTACC 


GCGCCTGCTT 


4200 


TGCTGAGGTC 


ATGCCTGCCC 


AGTGGGAACT 


4260 


GGGAGATCAT 


CTCTGGGTGG 


CCCGTTTCAT 


4320 


AATAGCAACC 


TTTGACCCCA 


AGCCCATTCC 


4380 


CAACTTTAGC 


ACCAAGGCCA 


TGCGGGAGGA 


4440 


CGAGAAACTA 


AGCAAGCGGC 


ACCGGTACCA 


4500 


GGACAATGCC 


CGTGGTCTGA 


CTGGGTTCCA 


4560 


TGGTGTCGCC 


AATCGCAGTG 


CCAGCATCCG 


4620 


AGGTTACTTT 


GAAGACCGCG 


GCCCCTCTGC 


4680 


CATCGTCCGC 


ACATGCCTTC 


TCAATGAGAC 


4740 


ATTAGACTTT 


GAGTGATCTT 


GAGCCTTTCC 


4800 


TTTGTGAAGG 


AACCTTACTT 


CTGTGGTGTG 


4860 


TAAAGCTCTA 


AGGTAAATAT 


AAAATTTTTA 


4920 


ATTGTTTGTG 


TATTTTAGAT 


TCCAACCTAT 


4980 


GCCTTTAATG 


AGGAAAACCT 


GTTTTGCTCA 


5040 


ACTGCTGACT 


CTCAACATTC 


TACTCCTCCA 


5100 


GACTTTCCTT 


CAGAATTGCT 


AAGTTTTTTG 


5160 


GCTTGCTTTG 


CTATTTACAC 


CACAAAGGAA 


5220 


GAAAAATATT 


CTGTAACCTT 


TATAAGTAGG 


5280 


TTTCTTACTC 


CACACAGGCA 


TAGAGTGTCT 


5340 


ACCTTTAGCT 


TTTTAATTTG 


TAAAGGGGTT 


5400 


ACTAGAGATC 


ATAATCAGCC 


ATACCACATT 


5460 



30 



35 



WO 96/40921 



PCI7US96/09287 



TGTAGAGGTT TTACTTCCTT TAAAAAACCT 
1 AATGAATGCA ATTGTTGTTG TTAACTTGTT 
CAATAGCATC ACAAATTTCA CAAATAAAGC 
GTCCAAACTC ATCAATGTAT CTTATCATGT 
GACTGCAGTG AATAATAAAA TGTGTGTTTG 

5 

CCTACTAAAT TCATGTCGCG CGATAGTGGT 
AAAAATCGAT ATTTGAAAAT ATGGCATATT 
ACTGATATCG CCATTTTTCC AAAAGTGATT 
CTTATATCGT TTACGGGGGA TGGCGATAGA 
10 TCGCAAATAT CGCAGTTTCG ATATAGGTGA 
GGCGACATCA AGCTGGCACA TGGCCAATGC 
CCATTAGCCA TATTATTCAT TGGTTATATA 
CATACGTTGT ATCCATATCA TAATATGTAC 
CCATGTTGAC ATTGATTATT GACTAGTTAT 

15 

CATAGCCCAT ATATGGAGTT CCGCGTTACA 
CCGCCCAACG ACCCCCGCCC ATTGACGTCA 
ATAGGGACTT TCCATTGACG TCAATGGGTG 
GTACATCAAG TGTATCATAT GCCAAGTACG 
20 CCCGCCTGGC ATTATGCCCA GTACATGACC 
TACGTATTAG TCATCGCTAT TACCATGGTG 
GGATAGCGGT TTGACTCACG GGGATTTCCA 
TTGTTTTGGC ACCAAAATCA ACGGGACTTT 
ACGCAAATGG GCGGTAGGCG TGTACGGTGG 

25 

AACCGTCAGA TCGCCTGGAG ACGCCATCCA 



88- 





CCCCTGAACC 


TGAAACATAA 


5520 


XrlX luLnuul 




JV A TV 1\ T 1 TV TV TV 

ACAAATAAAG 


5580 






ui lbitjbl X X 




O'TCICZ 1TPTPT 
*_» x won iui^i 


1PPTTPPTPT 

/iLrl— X X \A» lb X 




5700 


Tppp tv a a t a r* 

X firlrl X /it— 


p PPTTTTP A P 
V»LA* X X X X uAl> 


ATTTCTGTCG 


C *7£P4 

D/OU 




p athp hpstp 


PPPHn*TA<TW|*Pn 

GCGATATTGG 


conn 

5820 


GAAAATGTCG 


CCGATGTGAG 


TTTCTGTGTA 


5880 


TTTGGGCATA 


CGCGATATCT 


GGCGATAGCG 


5940 


CGACTTTGGT 


GACTTGGGCG 


ATTCTGTGTG 


6000 


CAGACGATAT 


GAGGCTATAT 


CGCCGATAGA 


6060 


m m«k n*^^^4 4* m^^fn 

ATATCGATCT 


to m to 4k fffkm/« to m 

ATACATTGAA 


TCAATATTGG 


. 6120 


GCATAAATCA 


ATATTGGCTA 


TTGGCCATTG 


6180 


ATTT AT AT TG 


GCTCATGTCC 


AACATTACCG 


6240 


m in "Ik mfk m to to m 

TAATAGTAAT 


CAATTACGGG 


GTCATTAGTT 


6300 


fn -m -m * 1111111% ^y^m 

TAACTTACGG 


TAAATGGCCC 


GCCTGGCTGA 


6360 


ATAATGACGT 


ATGTTCCCAT 


4k ^4 #44 4k 4k /*iO f% f% 

AGTAACGCCA 


6420 


GAGTATTTAC 


^% r% rri tv iv iv »imon 

GGTAAACTGC 


CCACTTGGCA 


c a on 

6480 


CCCCCTATTG 


iv r"« fT»r"« tv iv m ^ iv 

ACGTCAATGA 


CGGTAAATGG 


6540 


TTATGGGACT 


TTCCTACTTG 


GCAGTACATC 


6600 


ATGCGGTTTT 


GGCAGTACAT 


CAATGGGCGT 


6660 


AGTCTCCACC 


CCATTGACGT 


CAATGGGAGT 


6720 


CCAAAATGTC 


GTAACAACTC 


CGCCCCATTG 


6780 


GAGGTCTATA 


TAAGCAGAGC 


TCGTTTAGTG 


6840 


CGCTGTTTTG 


ACCTCCATAG 


AAGACACCGG 


6900 
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GACCGATCCA 


GCCTCCGCGG 


CCGGGAACGG 


TGCATTGGAA 


CGCGGATTCC 
wvjvvuni xv_»v_ 


x u^Wnnu 


07DU 


1 


AGTGACGTAA 


GTACCGCCTA 


TAGAGTCTAT 


AGGCCCACCC 


CCTTGGCTTC 

wwX IvVlvl X W 


x lnXuUnluW 


/ UzU 




TATACTGTTT 


TTGGCTTCGG 


GTCTATACAC 


CCCCGCTTCC 


TCATGTTATA 


GGTCATRRTA 
"w x un x vjv x n 


7nftn 

/ \JO\J 




TAGCTTAGCC 


TATAGGTGTG 


GGTTATTGAC 


CATTATTGAC 


CACTCCCCTA 


TTGGTGACGA 


71 AO 


5 


TACTTTCCAT 


TACTAATCCA 


TAACATGGCT 


CTTTGCCACA 


ACTCTCTTTA 


TTGGCT AT A T 


7200 


GCCAATACAC 


TGTCCTTCAG 


AGACTGACAC 


GGACTCTGTA 


TTTTTACAGG 


ATGGGG TCTC 


7260 




ATTTATTATT 


TACAAATTCA 


CATATACAAC 


ACCACCGTCC 


CCAGTGCCCG 


CAGTTTTTAT 


7320 




TAAACATAAC 


GTGGGATCTC 


CACGCGAATC 


TCGGGTACGT 


GTTCCGGACA 


TGCGCTCTTr* 

A. UUVV X X X Vw 






TCCGGTAGCG 


G CGGAGCTTC 


TACATCCGAG 


CCCTGCTCCC 


ATGCCTCCAG 


CGACTCATGG 


7AAO 


10 


TCGCTCGGCA 


TCTCCTTGCT 


CCTAACAGTG 


GAGGCCAGAC 


TTAGGCAPAG 


UUA> fi 1 w Lvv 


7CAft 

/ DUU 




ACCACCACCA 


GTGTGCCGCA 


CAAGGCCGTG 


GCGGTAGGGT 




a & A tp ik r* r^pr* 

nnAlbAbUiw 


/ DOU 




GGGGAGCGGG 


CTTGCACCGC 


TGACGCATTT 


GGAAGACTTA 


AGGCAGCGGC 


AGAAGAAGAT 


7620 




GCAGGCAGCT 


GAGTTGTTGT 


GTTCTGATAA 


GAGTCAGAGG 


TAACTCCCGT 


TGCGGTGCTG 


7680 


15 


TTAACGGTGG 


AGGGCAGTGT 


AGTCTGAGCA GTACTCGTTG 


CTGCCGCGCG 


CGCCACCAGA 


7740 


CATAATAGCT 


GACAGACTAA 


CAGACTGTTC 


CTTTCCATGG 


GTCTTTTCTG 


CAGTCACCGT 


7800 




CCTTGACACG 


AAGCTTGGGC 


TGCAGGTCGA 


TCGACTCTAG 


AGGATCGATC 


CCCGGGCGAG 


7860 




CTCG 












7864 
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WHAT IS CLAIMED IS ; 

1 

1. A CDR-grafted antibody capable of 
inhibiting human tissue factor wherein the 
complementarity determining regions (CDRs) are derived 

5 from a non-human monoclonal antibody against tissue 
factor and the framework (FR) and constant (C) regions 
are derived from one or more human antibodies. 

2. The CDR-grafted antibody of Claim 1 
wherein said non-human monoclonal antibody is a murine 

10 antibody. 

3. The CDR-grafted antibody of Claim 2 
wherein said murine antibody is TF8-5G9. 

4. The CDR-grafted antibody of Claim 1 
wherein said CDRs of the heavy chain have the amino acid 

15 sequences : 

CDR1 DDYMH (SEQ ID NO: 5) 

CDR2 LIDPENGNTIYDPKFQG (SEQ ID NO: 6) 

CDR3 DNSYYFDY (SEQ ID NO: 7) 

and said CDRs of the light chain have the amino acid 
20 sequences : 

CDR1 KASQDIRKYLN (SEQ ID NO: 8) 

CDR2 YATSLAD (SEQ ID NO: 9) 

CDR3 LQHGESPYT (SEQ ID NO: 10). 

5. The CDR-grafted antibody of Claim 1 

25 wherein the FR of the heavy chain is derived from the 
human antibody KOL. 

6. The CDR-grafted antibody of Claim 1 
wherein the FR of the light chain is derived from the 
human antibody REI. 
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7. The CDR-grafted antibody of Claim 1 

1 wherein the heavy chain variable region has the amino 
acid sequence of SEQ ID NO: 11. 

8. The CDR-grafted antibody of Claim 1 or 7 
wherein the light chain variable region has the amino 

5 acid sequence of SEQ ID NO: 12. 

9. The CDR-grafted antibody of Claim 1 
wherein the heavy chain variable region has the amino 
acid sequence of SEQ ID NO: 13. 

10. The CDR-grafted antibody of Claim 1 or 9 
10 wherein the light chain variable region has the amino 

acid sequence of SEQ ID NO: 14. 

11. The CDR-grafted antibody of Claim 1 
wherein the heavy chain constant region is the human 
IgG4 constant region. 

15 12. The CDR-grafted antibody of Claim 10 

wherein the heavy chain constant region is the human 
IgG4 constant region. 

13. The CDR-grafted antibody of Claim 1 
wherein the light chain constant region is the human 

20 kappa constant region. 

14. The CDR-grafted antibody of Claim 10 
wherein the light chain constant region is the human 
kappa constant region. 

15. CDR-grafted monoclonal antibody TF8HCDR1 
25 x TF8LCDR1. 

16. CDR-grafted monoclonal antibody TF8HCDR20 
x TF8LCDR3. 

17. A fragment of the CDR-grafted antibody of 
Claim 1 wherein said fragment is capable of inhibiting 

30 human tissue factor. 
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18. The fragment of Claim 17 wherein said 
1 fragment is an Fab or F(ab') 2 fragment. 

19. A method of making the CDR-grafted 
antibody of Claim 1 comprising cotransf ecting a host 
cell with an expression vector comprising a nucleic acid 

5 encoding the CDR-grafted antibody heavy chain and an 
expression vector comprising a nucleic acid encoding the 
CDR-grafted antibody light chain; culturing the 
transfected host cell; and recovering said CDR-grafted 
antibody. 

10 20. A method of making the CDR-grafted 

antibody of Claim 1 comprising trans f ecting a host cell 
with an expression vector comprising a nucleic acid 
encoding the CDR-grafted antibody heavy chain and a 
nucleic acid encoding the CDR-grafted antibody light 

15 chain; culturing the transfected host cell; and 
recovering said CDR-grafted antibody. 

21. The method of Claim 18 or 19 wherein said 
nucleic acid encoding the CDR-grafted antibody heavy 
chain has the sequence of nucleotides 1-2360 of SEQ ID 

20 NO: 15. 

22. The method of Claim 18 or 19 wherein said 
nucleic acid encoding the CDR-grafted light chain has 
the sequence of nucleotides 1-759 of SEQ ID NO; 17. 

23. The method of Claim 19 or 20 wherein said 
25 host cell is a bacterial cell, yeast cell, insect cell 

or mammalian cell. 

24. The method of Claim 23 wherein said 
mammalian cell is a CHO cell, COS cell or myeloma cell. 

25. The method of Claim 19 wherein said 

30 expression vector comprising a nucleic acid encoding the 
CDR-grafted antibody heavy chain is pEe6TF8HCDR20. 
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26. The method of Claim 19 wherein said 

1 expression vector comprising a nucleic acid encoding the 
CDR-grafted antibody light chain is pEel2TF8LCDR3 . 

27. A nucleic acid encoding the heavy chain 
of the CDR-grafted antibody of Claim 1. 

5 28. A nucleic acid encoding the light chain 

of the CDR-grafted antibody of Claim 1. 

29. The nucleic acid of Claim 27 having the 
sequence of nucleotides 1-2360 of SEQ ID NO: 15. 

30. The nucleic acid of Claim 28 having the 
10 sequence of nucleotides 1-759 of SEQ ID NO: 17. 

31. A method of attenuation of coagulation 
comprising administering a therapeutically effective 
amount of a CDR-grafted antibody capable of inhibiting 
human tissue factor to a patient in need of said 

15 attenuation. 

32. The method of Claim 31 wherein said CDR- 
grafted antibody is TF8HCDR20 x TF84CDR3 . 

33. A method of treatment or prevention of 
thrombotic disorder comprising administering a 

20 therapeutically effective amount of a CDR-grafted 

antibody capable of inhibiting human tissue factor to a 
patient in need of said treatment or prevention. 

34. The method of Claim 33 wherein said 
thrombotic disorder is intravascular coagulation, 

25 arterial restenosis or arteriosclerosis. 

35. The method of Claim 33 or 34 wherein said 
CDR-grafted antibody is TF8HCDR20 x TF8LCDR3. 

36. A pharmaceutical composition comprising 
at least one CDR-grafted antibody capable of inhibiting 

30 human tissue factor and a pharmaceutically acceptable 
carrier. 
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37. The pharmaceutical composition of Claim 
1 36 wherein said CDR-grafted antibody is TF8HCDR20 x 
TF8LCDR3 . 
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Sequence of the murine TF8-5G9 heavy chain cDNA with protein 
translation. The essential regions of the cDNA are as follows: 



Nucleotides 

I- 10 

II- 67 
68-418 
419-1390 
1391-1489 



Sequence Range: 1 to 1489 



Region 

5' untranslated region. 

Start codon and leader sequence. 

Variable region. 

Murine lgG1 constant region. 

3* untranslated region. 



10 20 30 40 

* ♦ * ♦ 

GCT CCT TAC A ATG AAA TGC ACC TGG CTC ATC TTC TTC CTG ATG CCA GTG 
CCA GGA ATG T TAC TTT ACG TCG ACC CAG TAG AAG AAG GAC TAC CCT CAC 
Met Lys Cye Ser Trp Val lie Phe Phe Leu Met Ala Val> 

50 60 70 80 90 



* 












• 








* 






• 






GTT 
CAA 
Val 


ACA 
TCT 
Thr 


GGG GTC AAT 
CCC CAG TTA 
Gly Val Asn 


TCA 
ACT 
Ser 


GAC 
CTC 
Glu 


ATT 
TAA 
lie 


CAC 
CTC 
Gin 


CTG 
GAC 
Leu 


CAG 
GTC 
Gin 


CAG 
GTC 
Gin 


TCT 
AGA 
Ser 


GGG CCT 
CCC CGA 
Gly Ala 


GAG 
CTC 
Glu> 


100 
• 






110 
* 




120 






130 






140 
* 




CTT 
GAA 
Leu 


CTG 
CAC 

val 


AGG 
TCC 
Ary 


CCA 
GGT 
Pro 


GGC 
CCC 
Gly 


CCC 
CCG 
Ala 


TTA 
AAT 
Leu 


GTC 
CAG 
Val 


AAG 

TTC 
Lys 


TTG 
AAC 
Leu 


TCC 
AGG 
Ser 


TGC 
ACC 
Cye 


AAA 

TTT 
Lye 


gct 

CCA 
Ala 


TCT 
AGA 
Ser 


GGC 
CCG 
Cly> 


150 
* 






160 
• 






170 
* 




180 






190 




TTC 
AAG 
Phe 


AAC 
TTC 
Ann 


ATT 
TAA 
lie 


AAA 
TTT 
Lys 


GAC 
CTC 
Aflp 


TAC 
ATG 
Tyr 


TAT 
ATA 
Tyr 


ATC 
TAC 
Met 


CAC 
GTG 
Hie 


TCC 
ACC 
Trp 


GTG 
CAC 
Val 


AAG 
TTC 
Lys 


CAG 
GTC 
Cln 


AGC 
TCC 
Ary 


* 

CCT 
GGA 
Pro 


CAA 
CTT 
Glu> 




200 
• 




210 
* 






220 
• 






230 




240 


CAG 
CTC 
Gin 


GGC 
CCG 
Gly 


CTC 
GAC 
Leu 


GAG 
CTC 
Clu 


TGG 
ACC 
Trp 


ATT 
TAA 
He 


GGA 
CCT 
Gly 


TTC 
AAC 
Leu 


ATT 
TAA 
He 


CAT 
CTA 
Aep 


CCT 
GGA 
Pro 


CAC 
CTC 
Glu 


AAT 
TTA 
Aen 


GCT 
CCA 
Gly 


AAT 
TTA 
Asn 


* 

ACT 
TGA 
Thr> 






250 
• 






260 
* 




270 






280 








ATA 
TAT 
Zle 


TAT 
ATA 
Tyr 


GAC 
CTG 
Asp 


CCC 
GGC 
Pro 


AAC 

TTC 
Lys 


TTC 
AAG 
Phe 


CAG 
CTC 
Gin 


GGC 
CCG 
Gly 


AAG 

TTC 
Lye 


CCC 
CCG 
Ala 


ACT 
TCA 
Ser 


ATA 
TAT 
He 


• 

ACA 
TGT 
Thr 


CCA 
CCT 
Ala 


GAC 
CTC 
Asp 


ACA 

TCT 
Thr> 


90 
* 




300 
* 






310 






320 




330 






TCC 
AGG 
Ser 


TCC 
AGG 
Sex 


AAC 
TTC 
Asa 


ACA 
TCT 
Thr 


CCC 
CCG 
Ala 


TAC CTG 
ATG GAC 
Tyr Leu 


CAG 
GTC 
Gin 


CTC AGC 
GAG TCG 
Leu Ser 


ACC 
TCC 
Ser 


CTC 
GAC 
Leu 


ACA 
TCT 
Thr 


w 

TCT 
AGA 
Ser 


GAG 
CTC 
Glu 


GAC 
CTG 
Asp> 
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FIG. 1 B 



340 


350 
* 




360 
* 




370 
* 


ACT GCC GTC 
TGA CGG CAG 
Thr Ala Val 


TAT TAC 
ATA ATG 
Tyr Tyr 


TGT 
ACA 
Cys 


CCT AGA GAT 
CGA TCT CTA 
Ala Arg Asp 


AAC 
TTG 
Asn 


TCG TAC TAC 
AGC ATG ATG 
Ser Tyr Tyr 


390 
- * 


400 
* 




410 
* 




420 


TGG GGC CAA 
ACC CCG GTT 
Trp Gly Gin 


GGC ACC 
CCG TGG 
Gly Thr 


ACT 
TGA 
Thr 


CTC ACA GTC 
GAG TGT CAG 
Leu Thr Val 


TCC 
AGG 
Ser 


* 

TCA GCC AAA 
AGT CGG TTT 
Ser Ala Lys 



380 



440 450 460 470 



CCA TCT GTC TAT CCA CTG GCC 





* 




AAA 


GAC 


TAC 


AAA 


CTG 


ATG 


Phe 


Asp 


Tyr> 




430 






• 




ACG 


ACA 


ccc 


TGC 


TGT 


GGG 


Thr 


Thr 


Pro> 




480 

<* 


ACT 


AAC 


TCC 


TGA 


TTG 


AGG 


Thr 


Asn Ser> 



490 

ATG GTG ACC CTG 
TAC CAC TGG GAC 
Met Val Thr Leu 



500 

GGA TGC CTG GTC 
CCT ACG GAC CAG 
Gly Cys Leu Val 



510 

AAG GGC TAT TTC 
TTC CCG ATA AAG 
Lyo Gly Tyr Phe 



520 

CCT GAG CCA GTG 
GGA CTC GGT CAC 

Pro Glu Pro Val> 



530 540 550 

• # •* 

ACA GTG ACC TGG AAC TCT GGA TCC 
TGT CAC TGG ACC TTG AGA CCT AGG 
Thr Val Thr Trp Asn Ser Gly Ser 

580 590 600 

« * * 

CCA GCT CTC CTC CAG TCT GAC CTC 
GGT CGA CAG GAC CTC AGA CTG GAG 
Pro Ala Val Leu Gin Ser Asp Leu 



560 570 
* * 

CTG TCC AGC GGT CTC CAC ACC TTC 
GAC AGG TCG CCA CAC GTG TGG AAG 
Leu Ser Ser Gly Val His Thr Phe> 

610 620 

TAC ACT CTG AGC AGC TCA GTG ACT 
ATG TGA GAC TCG TCG AGT CAC TCA 
Tyr Thr Leu Ser Ser Ser Val Thr> 



630 640 650 660 670 

• * * * * 

GTG CCC TCC AGC ACC TGG CCC ACC GAC ACC GTC ACC TGC AAC GTT GCC 

CAC GGC AGG TCC TGC ACC GGG TCG CTC TGG CAG TGG ACG TTC CAA CGG 

Val Pro Ser Ser Thr Trp Pro Ser Glu Thr Val Thr Cys Asn Val Ala> 



680 690 
• * 

CAC CCG GCC AGC AGC ACC AAG GTC 
CTG GGC CGG TCG TCC TGG TTC CAC 
His Pro Ala Ser Ser Thr Lys Val 



700 710 720 

* » • 

GAC AAG AAA ATT GTG CCC AGC GAT 

CTC TTC- TTT TAA CAC GGG TCC CTA 

Asp Lys Lys lie Val Pro Arg Asp> 



730 740 

TCT GGT TGT AAC CCT TGC ATA TCT 
ACA CCA ACA TTC GGA ACG TAT ACA 
Cys Gly Cys Lys Pro Cys lie Cys 



750 760 
• * 

ACA GTC CCA GAA CTA TCA TCT GTC 
TGT CAG GGT CTT CAT AGT AGA CAG 
Thr Val Pro Glu Val Ser Ser Val> 



770 780 790 800 810 

• • • • • 

TTC ATC TTC CCC CCA AAG CCC AAG GAT CTC CTC ACC ATT ACT CTG ACT 
AAG TAG AAG GGC GCT TTC CGG TTC CTA CAC GAG TCG TAA TGA GAC TGA 
Phe lie Phe Pro Pro Lys Pro Lys Asp Val Leu Thr lie Thr Leu Thr> 
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FIG. 1 C 



820 830 840 650 860 



* 

CCT AAG 
GO* TTC 
Pro Lys 


GTC 
CAG 
Val 


ACG 
TGC 
Thr 


* 

TCT 
ACA 
Cys 


CTT 
CAA 
Val 


GTC 
CAC 
Val 


* 

CTA 
CAT 
Val 


GAC 
CTG 
Asp 


ATC 
TAG 
lie 


* 

AGC 
TCG 
Ser 


AAC 

roc 

Lys 


GAT 
CTA 
Asp 


GAT 
CTA 
Asp 


* 

CCC GAG 
GGG CTC 
Pro Glu> 


870 
• 






880 
* 






890 
* 




900 






910 




CTC 
CAG 

Val 


CAG TTC 
CTC AAG 
Gin Phe 


AGC 
TCC 
Ser 


TGC 
ACC 
Trp 


TTT 
AAA 
Phe 


CTA 
CAT 
Val 


GAT 
CTA 
Asp 


GAT 
CTA 
Asp 


GTC 
CAC 
Val 


GAC 
CTC 
Glu 


CTG 
CAC 
Val 


CAC 
GTC 
His 


ACA 
TCT 
Thr 


* 

GCT 
CGA 
Ala 


CAG 
GTC 
Gln> 




920 
• 




930 
* 






940 
* 






950 
+ 




960 


ACG 
TGC 
Thr 


CAA 
CTT 
Gin 


CCC 
GGG 
Pro 


CGG 
CCC 
Arg 


GAG 
CTC 
Glu 


GAG 
CTC 
Glu 


CAG 
GTC 
Gin 


TTC 
AAG 
Phe 


AAC 
TTG 
Asn 


AGC 
TCG 
Ser 


ACT 
TGA 
Thr 


TTC CGC 
AAC CCG 
Phe Arg 


TCA 
ACT 
Ser 


GTC 
CAG 
Val 


m 

ACT 
TCA 
Ser> 






970 
* 






980 
* 




990 




1000 








GAA 
CTT 
Glu 


CTT 
GAA 
Lou 


CCC 
GGG 
Pro 


ATC 
TAG 
He 


ATG 
TAC 
Met 


CAC 
GTC 
His 


CAG 
CTC 
Gin 


GAC 
CTC 
Asp 


TCC 
ACC 
Trp 


CTC 
GAG 
Leu 


AAT 
TTA 
Asn 


GGC 
CCG 
Gly 


w 

AAG 

roc 
Lys 


GAG 

CTC 
Glu 


roc 

AAC 
Phe 


AAA 

TTT 
Lys> 


HO 
• 




1020 
• 




1030 
• 






1040 




1050 






TGC 
ACG 
Cye 


AGG 
TCC 
Ary 


GTC 
CAG 
Val 


AAC 
TTG 
Asn 


ACT 
TCA 
Ser 


CCA 
CCT 
Ala 


GCT 
CCA 
Ala 


roc 

AAC 
Phe 


CCT 
GGA 
Pro 


GCC 
CGG 
Ala 


CCC 
GGG 
Pro 


ATC 
TAG 
He 


GAG 
CTC 
Glu 


w 

AAA 
TTO 
Lys 


ACC 
TCC 
Thr 


ATC 
TAG 
Xle> 


1060 
* 






1070 
* 




1080 
* 




1090 






1100 




TCC 
AGG 
Ser 


AAA 

TTT 
Lya 


ACC 
TGC 
Thr 


AAA GGC 
TTT CCG 
Lys Gly 


AGA 
TCT 
Ary 


CCC 
GGC 
Pro 


AAG 

roc 

Lys 


GCT 
CCA 
Ala 


CCA 
GCT 
Pro 


CAG 
GTC 
Gin 


GTC 
CAC 
Val 


TAC 
ATG 
Tyr 


ACC 
TGC 
Thr 


* 

ATT 
TAA 
He 


CCA 
GCT 
Pro> 


1110 
• 




1120 
• 






1130 




1140 




1150 




CCT 
GGA 
Pro 


CCC 
GGG 
Pro 


AAG 
TTC 
Lye 


GAG 
CTC 
Glu 


CAG 
GTC 
Gin 


ATG 
TAC 
Met 


CCC 
CGG 
Ala 


AAG 
TTC 
Lye 


GAT 
CTA 
Asp 


AAA 
TTT 
Lys 


CTC 
CAG 
Val 


ACT 
TCA 
Ser 


CTC 
GAC 
Leu 


ACC 
TCC 
Thr 


• 

TGC 
ACG 
Cys 


ATG 
TAC 
Met> 




1160 
* 




1170 
• 




1180 
• 






1190 




1200 


ATA 
TAT 
lie 


ACA GAC 
TGT CTG 
Thr Asp 


TTC 
AAG 
Phe 


TTC 
AAC 
Phe 


CCT 
GGA 
Pro 


GAA 

CTT 
Glu 


GAC 
CTC 
Asp 


ATT 
TAA 
lie 


ACT 
TGA 
Thr 


CTC 
CAC 
Val 


GAC 
CTC 
Glu 


TCG 
ACC 
Trp 


CAC 
CTC 
Gin 


TCG 
ACC 
Trp 


• 

AAT 
TTA 
Asn> 




1210 
• 






1220 




1230 
* 




1240 








GGG 
CCC 
Gly 


CAG 
CTC 
Gin 


CCA 
GCT 
Pro 


CCG 
CCC 
Ala 


GAG 
CTC 
Glu 


AAC 
TTC 
Asn 


TAC 
ATC 

Tyr 


AAG 
TTC 
Lys 


AAC ACT 
TTC TGA 
Asn Thr 


CAG 
GTC 
Gin 


CCC 
GGG 
Pro 


w 

ATC 
TAG 
He 


ATC 
TAC 
Met 


GAC 
CTC 
Asp 


ACA 
TCT 
Thr> 


150 
• 




1260 
• 




1270 
* 






1280 




1290 






GAT 
CTA 
Asp 


GGC TCT TAC 
CCG AGA ATG 
Gly Ser Tyr 


TTC 
AAC 
Phe 


CTC 
CAG 
Val 


TAC 
ATG 
Tyr 


ACC 
TCC 
Ser 


AAC 

roc 
Lys 


CTC 
GAC 
Leu 


AAT 
TTA 
Asn 


CTG 
CAC 
Val 


CAG 
GTC 
Gin 


w 

AAG 

roc 

Lys 


AGC 
TCG 
Ser 


AAC 

roc 
Asn> 
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FIG. 1 D 

1300 1310 1320 1330 1340 

* • * « • 

TGG GAG GCA CCA AAT ACT TTC ACC TGC TCT GTC TTA CAT GAG GGC CTG 
ACC CTC CCT CCT TTA TCA AAC TOO ACC AGA CAC AAT CTA CTC CCC GAC 
Trp Glu Ala Gly Ara Thr Phe Thr CyB Sex Val Leu Hie Clu Cly Leu> 



1350 
* 






1360 






1370 
* 




CAC AAC 
CTG TTG 
Hie Aim 


CAC 
GTG 
His 


CAT 
CTA 
HiB 


ACT 
TGA 
Thr 


GAG 
CTC 
GlU 


AAG 
TTC 
Ly8 


AGC CTC 
TCC GAG 
Ser Leu 


TCC 
AGG 
Ser 


1400 
* 




1410 






1420 




CCA GTG 
GGT CAC 


TCC 
AGG 


TTG 
AAC 


GAG 
CTC 


CCC 
GGG 


TCT GGT CCT ACA 
AGA CCA GGA TGT 


1450 




1460 




1470 




CCA CCC 
GGT GGG 


CTC 
GAG 


CCT 
GGA 


GTA 

CAT 


TAA 
ATT 


ATA 
TAT 


AAG CAC 
TTC GTC 


CCA 
GGT 



1380 1390 



1430 1440 



1480 
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Sequence of the murine TF8-5G9 light chain cDNA with protein 
translation. The essential regions of the cDNA are as follows: 



FIG. 2 A 



Nucleotides 

1-4 

5-64 

65-385 

386-706 

707-917 

918-937 



Region 

5 1 untranslated. 

Start codon and leader sequence. 

Variable region. 

Murine kappa constant region. 

3' untranslated region. 

Poly A tail. 



Sequence Range: 1 to 937 



10 20 30 40 

* * • * 

GGA C ATC CCG GCC CCT CCT CAG TTT TIT GGC ATC TTG TTG CTC TCC TTT 
CCT G TAC GCC CGC GGA CGA GTC AAA AAA CCC TAG AAC AAC GAG ACC AAA 
Met Ary Ala Pro Ala Gin Phe Phe Gly He Leu Leu Leu Trp Phe> 



50 60 70 60 90 









# 






• 








• 






* 






CCA 


GGT 


ATC 


AGA 


TCT 


GAC 


ATC 


AAG 


ATC 


ACC 


CAG 


TCT 


CCA 


TCC 


TCC 


ATG 


OCT 


CCA 


TAG 


TCT 


ACA 


CTG 


TAG 


TTC 


TAC 


TGG 


GTC 


ACA 


GGT 


ACC 


ACC 


TAC 


Pro 


Gly 


He 


Ary 


Cye 


Asp 


lie 


Lye 


Met 


Thr 


Gin 


Ser 


Pro 


Ser 


Ser 


Met> 



100 
* 






110 
* 




120 






130 

• 




140 




TAT 
ATA 
Tyr 


GCA 
CCT 
Ala 


TCG 
AGC 
Ser 


CTG 
GAC 
Leu 


GGA 
CCT 
Cly 


GAC 
CTC 
Glu 


AGA 

TCT 
Ary 


CTC 
CAG 

Val 


ACT 
TCA 
Tnr 


ATC 
TAG 
lie 


ACT 
TGA 
Thr 


TCT 
ACA 
Cye 


AAC CCC 
TTC CCC 
Lye Ala 


ACT 
TCA 
Ser 


CAG 
CTC 
Gln> 


150 
• 






160 
• 






170 
* 




180 




190 




GAC 
CTG 
Asp 


ATT 
TAA 
lie 


AGA 
TCT 
Ary 


AAC 
TTC 
Lye 


TAT 
ATA 

Tyr 


TTA 
AAT 
Leu 


AAC 
TTG 
Abo 


TGG 
ACC 
Trp 


TAC 
ATG 
Tyr 


CAG 
GTC 
Gin 


CAG 
CTC 
Gin 


AAA 
TTT 
Lye 


CCA TGG 
GGT ACC 
Pro Trp 


• 

AAA 
TTT 
Lye 


TCT 
ACA 
Ser> 




200 
• 




210 
• 






220 

• 






230 


240 


CCT 
GGA 
Pro 


AAC 
TTC 
Lye 


ACC 
TGG 
Thr 


CTC 
GAC 
Leu 


ATC 
TAC 
He 


TAT 
ATA 

Tyr 


TAT 
ATA 
Tyr 


GCA 
CCT 
Ala 


ACA 
TGT 
Thr 


ACC 
TCC 
Ser 


TTC 
AAC 
Leu 


CCA 
CCT 
Ala 


GAT CGC 
CTA,CCC 
Asp Cly 


CTC 
CAC 
Val 


CCA 
GGT 
Pro> 






250 
• 






260 
• 




270 
• 






280 






TCA 
ACT 
Ser 


ACA 

TCT 
Ary 


TTC 
AAG 
Phe 


ACT 
TCA 
Ser 


CCC 
CCG 
Gly 


ACT GGA 

TCA CCT 
Ser Cly 


TCT 
AGA 
Ser 


CCG CAA 
CCC GTT 
Cly Gin 


GAT 
CTA 
Aep 


TAT 
ATA 
Tyr 


TCT CTA 
ACA GAT 
Ser Leu 


ACC 
TCG 
Thr 


ATC 
TAG 
Ile> 


90 
• 




300 
• 






310 
* 






320 
* 




330 
* 






AGC 
TCG 
Sex 


ACC 
TCG 
Ser 


CTC 
GAC 
Leu 


GAC 
CTC 
Glu 


TCT 
AGA 
Ser 


GAC GAT 
CTG CTA 
Asp Asp 


ACA 
TCT 
Thr 


GCA ACT TAT 
CCT TGA ATA 

Ala Thr Tyr 


TAC 
ATG 

Tyr 


TGT CTA 
ACA GAT 
Cys Leu 


CAA 
GTT 
Gin 


CAT 
GTA 
K±fl> 
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FIG. 2B 

340 350 360 370 380 

* - * * » * 

CGT GAG ACC CCG TAC ACG TTC GGA GGG GGG ACC AAG CTC GAA ATA AAC 
CCA CTC TCG GGC ATG TGC AAG CCT CCC CCC TGG TTC GAC CTT TAT TTG 
Gly Glu Ser Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Asn> 

390 400 410 420 430 

* * * * * 

AGG CCT GAT CCT GCA CCA ACT GTA TCC ATC TTC CCA CCA TCC ACT GAG 
TCC CGA CTA CGA CGT GGT TGA CAT AGG TAG AAG GGT GGT AGG TCA CTC 
Arg Ala Asp Ala Ala Pro Thr Val Ser He Phe Pro Pro Ser Ser Glu> 

440 450 460 470 480 



CAG TTA 
GTC AAT 
Gin Leu 


* 

ACA 
TGT 
Thr 


TCT 
AGA 
Ser 


* 

GGA GGT 
CCT CCA 
Gly Gly 


GCC 
CGG 
Ala 


* * 

TCA GTC CTG TGC TTC TTG 
ACT CAG CAC ACG AAG AAC 
Sex Val Val Cys Phe Leu 


AAC 
TTG 
Asn 


* 

AAC TTC 
TTG AAG 
Asn Phe> 






490 
* 






500 
* 


510 520 
* • 






TAC CCC 
ATG GGG 
Tyr Pro 


AAA 

TTT 
Lys 


GAC 
CTG 
Asp 


ATC 
TAG 
He 


AAT 
TTA 
Asn 


GTC 
CAG 
Val 


AAG TGG AAG ATT GAT GGC 
TTC ACC TTC TAA CTA CCC 
Lys Trp Lys He Asp Gly 


ACT 
TCA 
Ser 


GAA CGA 
CTT GCT 
Glu Arg> 


30 
* 




540 
* 






550 
• 


560 570 




CAA 
CTT 
Gin 


AAT 
TTA 
Asn 


GGC 
CCC 
Gly 


GTC 
CAG 

val 


CTC 
GAC 
Leu 


AAC 
TTG 
Asn 


ACT TGG ACT GAT CAC GAC AGC 
TCA ACC TGA CTA GTC CTG TCG 
Ser Trp Thr Asp Gin Asp Ser 


AAA 

TTT 
Lys 


GAC ACC 
CTG TCG 
Asp Ser> 


580 
• 






590 
• 




600 610 
* * 


620 
* 


ACC TAC 
TGG ATC 
Thr Tyr 


AGC 
TCC 
Ser 


ATG 
TAC 
Met 


ACC 
TCG 
Ser 


AGC 
TCC 
Ser 


ACC 
TGG 
Thr 


CTC ACG TTG ACC AAG CAC 
GAG - TGC AAC TGG TTC CTC 
Leu Thr Leu Thr Lys Asp 


GAG 
CTC 
Glu 


TAT GAA 
ATA CTT 
Tyr Glu> 


630 
* 






640 






650 660 
* * 




670 
* 


CGA 
GCT 
AT* 


CAT 
GTA 

His 


AAC 
TTG 
Asn 


ACC 
TCC 
Ser 


TAT ACC 
ATA TCG 
Tyr Thr 


TCT 
ACA 
Cys 


GAG GCC ACT CAC AAG ACA 
CTC CGG TGA CTG TTC TCT 
Glu Ala Thr His Lys Thr 


TCA 
ACT 
Ser 


ACT TCA 
TCA ACT 
Thr Ser> 




680 
• 




690 
• 




700 710 
* * 




720 


CCC 
GGC 
Pro 


ATT 
TAA 
lie 


CTC 
CAG 
Val 


AAG 
TTC 
Lye 


AGC 
TCC 
Ser 


TTC 
AAG 
Phe 


AAC 
TTC 
Asn 


AGG AAT GAG TCT TA GAG ACA AAG CTC CTC 
TCC TTA CTC ACA AT CTC TCT TTC CAG GAC 
Arg Asn Glu Cys> 




730 
• 




740 
• 




750 760 
• * 




770 


AGA 

TCT 


CGC 
GCG 


CAC 
GTC 


CAC 
CTC 


CAG 
GTC 


CTC 
GAG 


CCC 
GGG 


AGC TCC ATC CTA TCT TCC 
TCG AGC TAC GAT AGA AGG 


CTT 
GAA 


# 

CTA AGG 
GAT TCC 






780 
* 






790 
* 


800 aio 
* • 






TCT 
AGA 


TCG 
ACC 


AGG 
TCC 


CTT 
GAA 


CCC 
GGC 


CAC 
CTC 


AAG 

TTC 


CGA CCT ACC ACT CTT GCC 
CCT GGA TCG TCA CAA CGC 


CTG 
CAC 


CTC CAA 
CAG CTT 
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FIG. 2 C 



820 



830 



840 
* 



850 



860 



ACC TCCTCCCCXCCTCCTTCTCCTCCrCCrCCCTTr CCT TCG CTT TTX 
TKAGGAGCGCTC^craAGAGaACrac^cra 



870 



880 



890 



900 



910 



TCA TGC TAX TAT TTG CAG AAA ATA TOC AAT AAA GTG AGT CTT TCC ACT 
ACT ACG ATT ATA AAC GTC TTT TAT AAC TTA TTT CAC TCA GAA ACC TGA 



920 



930 



TGA AAA AAA AAA AAA AAA AAA A 
ACT TTT TTT TTT TTT TTT TTT T 
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- * The pEe6TF8HCDR20 expression vector DNA sequence. The 
F IG d A codin 9 re 9 ions of the TF8-5G9 CDR-grafted HC gene, TF8HCDR20, 
* fW€- ^ n are translated. 



Sequence Range: 1 to 7073 







10 
* 






20 
* 






30 
* 






40 
* 






GAA TTC 
CTT AAG 


GCC 
CGG 


GCC 
CGG 


ACC 
TGG 


ATG 
TAC 
Met 


GAA TGG AGC 
CTT ACC TCG 
Glu Trp Ser 


TGG 
ACC 
Trp 


GTC 
CAG 
Val 


TOT 
AAA 
Phe 


CTC 
GAG 
Leu 


TTC 
AAG 
Phe 


TTC 
AAG 
Phe 


TTG 
AAC 
Leu> 


50 
* 






60 
* 






70 
* 






00 
* 






90 
* 






TCA 
ACT 
Ser 


GTA 
CAT 
Val 


ACT ACA GGT 
TGA TGT CCA 
Thr Thr Gly 


GTA 
CAT 
Val 


CAC 
GTG 
Hie 


TCA CAA 
ACT GTT 
Ser Gin 


GTT 
CAA 
Val 


CAC 
GTC 
Gin 


CTG 
GAC 
Leu 


GTG 
CAC 
Val 


GAG 
CTC 
Glu 


TCT 
AGA 
Ser 


GGA 
CCT 
Gly> 


100 
* 




110 
* 






120 
* 






130 




140 




GGA 
CCT 
Gly 


GGA 
CCT 
Gly 


GTA 
CAT 
Val 


GTA 
CAT 
Val 


CAA 
GTT 
Gin 


CCT 
GGA 
Pro 


GGA 
CCT 
Gly 


AGG 
TCC 
Ary 


TCA 
ACT 
Ser 


CTG 
GAC 
Leu 


AGA 
TCT 
Ary 


CTC 
GAC 
Leu 


TCT 
AGA 
Ser 


TGT 
ACA 
Cys 


w 

AAG 

TTC 
Lye 


GCT 
CGA 
Ala> 




150 
• 






160 
* 




170 
* 






180 






190 


ACT 
TCA 
Ser 


GGA 
CCT 
Gly 


TTC 
AAG 
Phe 


AAT 
TTA 
Asn 


ATC 
TAG 
lie 


AAG 

TTC 
Lye 


GAC 
CTC 
Asp 


TAT 
ATA 
Tyr 


TAT 


ATC 
TAC 
Met 


CAC 
GTC 
His 


• 

TCG 
ACC 
Trp 


GTC 
CAG 
Val 


AGA 
TCT 
Ary 


CAA 
GTT 
Gin 


• 

GCT 
CGA 
Ala> 




200 
• 






210 
* 






220 
• 




230 






240 


CCT 
GGA 
Pro 


GGA 
CCT 
Gly 


AAA 
TTT 
Lye 


GGA 
CCT 
Gly 


CTC 
GAG 
Leu 


GAG 
CTC 
Glu 


TCG 
ACC 
Trp 


ATA 
TAT 
He 


GGT 
CCA 
Gly 


TTA 
AAT 
Leu 


ATT 
TAA 
He 


GAT 
CTA 
Asp 


CCT 
GGA 
Pro 


GAG 
CTC 
Glu 


AAT 
TTA 
Asn 


# 

GGT 
CCA 
Gly> 






250 
• 




260 
• 






270 
• 






280 






AAC 
TTG 
Asn 


ACG 
TGC 
Thr 


ATA 
TAT 
He 


TAT 
ATA 
Tyr 


GAT 
CTA 
Asp 


ccc 

GGG 
Pro 


AAG 
TTC 
Lye 


TTC 
AAG 
Phe 


CAA 
GTT 
Gin 


GGA 
CCT 
Gly 


AGA 
TCT 
Ary 


TTC 
AAG 
Phe 


ACA 
TGT 
Thr 


w 

ATT 
TAA 
lie 


TCT 
AGA 
Ser 


GCA 
CCT 
Ala> 


290 
* 






300 
• 






310 
* 




320 






330 






GAC 
CTG 
Asp 


AAC 
TTG 
Asn 


TCT 
AGA 
Ser 


AAG 
TTC 
Lye 


AAT 
TTA 

Asn 


ACA 
TCT 
Thr 


CTG 
GAC 
Leu 


TTC 
AAG 
Phe 


CTC 
GAC 
Leu 


CAG 
CTC 
Gin 


ATC 
TAC 
Met 


GAC 
CTG 
Asp 


TCA 
ACT 
Ser 


* 

CTC 
GAG 
Leu 


AGA 
TCT 
Ary 


CCT 
GGA 
Pro> 


340 
• 




350 
* 






360 
• 






370 




380 




GAG 
CTC 
Glu 


GAT 
CTA 
Afip 


ACA 
TCT 
Thr 


CCA 
CCT 
Ala 


GTC 
CAC 
Val 


TAC 
ATG 
Tyr 


TAT 
ATA 
Tyr 


TGT 
ACA 
Cys 


CCT 
CCA 
Ala 


AGA 
TCT 
Ary 


GAT 
CTA 
Asp 


AAC 
TTC 
Asn 


ACT 
TCA 
Ser 


* 

TAT TAC 
ATA ATG 
Tyx Tyr 


TTC 
AAG 
Phe> 



390 400 410 420 430 



GAC TAC TGG GGC CAA GGA ACA CCA GTC ACC GTG ACC TCA GCT TCC ACC 
CTG ATC ACC CCC GTT CCT TGT GGT CAC TCG CAC TCC ACT CGA ACG TGG 
Asp Tyr Trp Gly Gin Gly Thr Pro Val Thr Val Sex Ser Ala Ser Thr> 
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FIG. 4 B 



440 450 

AAG GCC CCA TCC GTC TTC CCC CTG 
TTC CCG CGT AGC CAG AAG GGG GAC 
Lya Gly Pro Ser Val Phe Pro Leu 



460 470 480 

* * * 

GCG CCC TGC TCC AGG AGC ACC TCC 
CGC GGG ACG AGG TCC TCC TGG AGC 
Ala Pro Cya Ser Arg Ser Thr Ser> 



490 500 510 520 

* * * * 

GAG AGC ACA GCC GCC CTG GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA 
CTC TCG TGT CGG CGG GAC CCG ACG GAC CAG TTC CTG ATG AAG GGG CTT 
Glu Ser Thr Ala Ala Leu Gly Cya Leu Val Lya Asp Tyr Phe Pro Glu> 

530 540 550 560 570 

* * • * * 

CCG GTG ACG GTG TCG TGG AAC TCA GGC GCC CTG ACC AGC GGC GTG CAC 
GGC CAC TGC CAC AGC ACC TTG ACT CCG CGG GAC TGG TCG CCG CAC GTG 
Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val Hia> 

560 590 600 610 620 

* * * • • 

ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA CTC TAC TCC CTC AGC ACC 
TGG AAG GGC CGA CAG GAT GTC AGG ACT CCT GAG ATG AGG GAG TCG TCG 
Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser> 

630 640 650 660 670 

* * * * * 

GTG CTG ACC GTG CCC TCC AGC AGC TTG GGC ACG AAG ACC TAC ACC TGC 
CAC CAC TGG CAC GGG AGG TCG TCG AAC CCG TCC TTC TGG ATG TGG ACG 
Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Lya Thr Tyr Thr Cya> 

600 690 700 710 720 

* * * * * 

AAC CTA GAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AGA CTT GGT 
TTG CAT CTA GTC TTC GGG TCG TTG TGG TTC CAC CTG TTC TCT CAA CCA 
Aan Val Aap Hie Lya Pro Ser Asn Thr Lya Val Asp Lya Ary Val> 

730 740 750 760 

• * • * 

GAC AGG CCA GCA CAG GGC AGG GAG CGT GTC TGC TGG AAG CCA GGC TCA 
CTC TCC GGT CCT CTC CCG TCC CTC CCA CAG ACC ACC TTC CGT CCG ACT 

770 780 790 BOO 810 

• * • • • 

GCC CTC CTC CCT GGA CGC ACC CCG GCT GTG CAG CCC CAG CCC AGG GCA 
CGC GAG GAC CGA CCT CCC TCC GGC CGA CAC CTC GGG GTC GGG TCC CCT 

820 830 840 850 860 

• • * * * 

GCA AGG CAT GCC CCA TCT CTC TCC TCA CCC GGA GGC CTC TGA CCA CCC 
CGT TCC CTA CGG GGT AGA CAG AGG ACT GGG CCT CCG GAG ACT GGT CGC 

870 880 890 900 910 

• • * * * 

CAC TCA TGC TCA GGG AGA GGG TCT TCT CGA TTT TTC CAC CAG GCT CCC 
CTG ACT ACG ACT CCC TCT CCC AGA AGA CCT AAA AAG GTG GTC CGA GGC 
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FIG. 4 C 

920 930 940 950 960 

• • * * * 

CCC AGC CAC AGG CTC GAT CCC CCT ACC CCA CCC COT CCG CAT ACA GGG 
CCC TCC CTC TCC CAC CPA CCC GGA TCC CCT CCC CCA CCC GTA TGT CCC 



970 980 
* * 

CCA GGT CCT CCG CTC AGA CCT CCC 
CCT CCA CCA CGC GAG TCT CCA CGG 

1010 1020 1030 

# * * 

CCC CCT GAC CTA ACC CCA CCC CAA 
CCG GGA CTC GAT TCG GCT CGG GTT 

1060 1070 1080 

* * * 

CTC AGA CAC CTT CTC TCC TCC CAC 
GAG TCT GTG GAA GAG AGG AGG CTC 



990 1000 

AAG ACC CAT ATC CCC GAG GAC CCT 
TTC TCG GTA TAG CCC CTC CTC GGA 

1040 1050 
* * 
AGG CCA AAC TCT CCA CTC CCT CAG 
TCC GGT TTG AGA GGT GAG GGA GTC 

1090 U00 

ATT CCA GTA ACT CCC AAT CTT CTC 
TAA GCT CAT TGA CGG TTA GAA GAG 



1110 

* 






1120 
* 




1130 
* 


1140 






1150 


TCT 


CCA 


GAG 


TCC 


-AAA TAT 


GGT 


CCC 


CCA TGC 


CCA 


TCA 


TGC 


CCA 


* 

CCT AAG 


AGA 


CCT 


CTC 


AGG 


TIT ATA 


CCA 


GGG 


GGT ACC 


CCT 


ACT 


ACC 


GCT 


CCA TTC 






Clu 


Ser Lys Tyr 


Gly 


Pro 


Pro Cye 


Pro 


Ser 


Cys 


Fro> 




1160 
* 




1170 
* 






1180 




1190 




1200 


CCA 


ACC 


CAC 


CCC 


TCC CCC 


TCC 


AGC 


TCA AGG 


CGG 


GAC 


AGG 


TGC 


* 

CCT AGA 


GCT 


TCG 


CTC 


CGC 


ACC CGG 


AGG 


TCC 


ACT TCC 


CCC 


CTC 


TCC 


ACC 


GGA TCT 



1210 1220 1230 1240 



GTA CCC TCC ATC CAG CCA CAG CCC CCA CCC CGG TCC TGA CGC ATC CAC 
CAT CGG ACC TAG GTC CCT CTC CCG GGT CGG CCC ACC ACT CCG TAG GTC 

1250 1260 1270 12B0 1290 



Pro Glu Phe Leu Gly cly Pro Ser Val Phe> 
1310 1320 1330 1340 



CTC 


CAT 


CTC 


GAC 


CTA 


GAG 


1300 
• 






CTG 


TTC 


CCC 


GAC 


AAC 


GGG 


Leu 


Phe 


Pro 



1350 1360 1370 1380 1390 





* 






• 




* 






• 




• 




GAC 


GTC 


ACC 


TCC 


CTC CTG 


CTG 


GAC GTG 


AGC 


CAC 


GAA GAC 


CCC 


GAG 


CTC 


CTC 


CAC 


TCC 


ACC 


CAC CAC 


CAC 


CTC CAC 


TCC 


CTC 


CTT CTG 


GGG 


CTC 


CAC 


Glu 


Val 


Thr 


Cys 


Val Val 


Val 


Asp Val 


Ser 


Gin Glu Asp 


Pro 


Clu 


Val> 




1400 
* 




1410 
* 




1420 
* 






1430 
• 




1440 


CAG 


TTC 


AAC 


TCG 


TAC CTG 


GAT 


GGC CTG 


GAG 


GTG 


CAT AAT 


GCC 


AAC 


ACA 


GTC 


AAG 


TTC 


ACC 


ATC CAC 


CTA 


CCC CAC 


CTC 


CAC 


CTA TTA 


CCG 


TTC 


TCT 


Gin 


Phe 


Kan 


Trp 


Tyr Val 


Aep 


Cly Val 


Glu 


Val 


His Aan 


Ala 


Lys 


Thr> 
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FIG. 4 D 



1450 1460 1470 14B0 







• 


• 




• 




• 






AAG 
TTC 
Lye 


CCC CGC GAG GAG 
GCC GCC CTC CTC 
Pro Arg Glu Glu 


CAG TTC AAC 
GTC AAG TTG 
Gin Phe Asn 


AGC 
TCG 
Ser 


ACQ TAC 
TGC ATG 
Thr Tyr 


CGT 
GCA 
Arg 


GTG 
CAC 
Val 


GTC AGC 
CAC TCC 
Val Ser 


CTC 
CAC 
Val> 


1490 
• 




1500 
* 


1510 
• 




1520 
* 




1530 
* 




CTC 
GAG 
Leu 


ACC 
TGG 
Thr 


GTC CTG CAC 
CAG GAC GTG 
Val Leu His 


CAG GAC TGG 
GTC CTG ACC 
Gin Asp Trp 


CTG 
GAC 
Leu 


AAC GGC 
TTG CCG 
Asn Gly 


AAG 
TTC 
Lys 


GAG 
CTC 
Glu 


TAC AAG 
ATG TTC 

Tyr Lys 


TGC 
ACC 
Cys> 


1540 
* 




1550 
* 


1560 
* 




1570 
* 






1580 
* 




AAG 
TTC 
LyB 


GTC 
GAG 
Val 


TCC AAC AAA 
AGG TTG TTT 
Ser Asn Lye 


GGC CTC CCG 
CCG GAG GGC 
Gly Leu Pro 


TCC 
AGG 
Ser 


TCC ATC 
AGG TAG 
Ser lie 


GAG 
CTC 
Glu 


AAA 
TTT 
Lys 


ACC ATC 
TGG TAG 
Thr lie 


TCC 
AGG 
Ser> 



1590 1600 1610 1620 1630 

* • * # * 

AAA GCC AAA GC TGG GAC CCA CCG GGT GCC AGG GCC ACA TCG ACA GAG GTC 
TTT CGG TTT CC ACC CTG GGT GCC CCA CGC TCC CGC TGT ACC TGT CTC CAG 
Lye Ala Lys> 

1640 1650 1660 1670 1680 

* « # * * 

ACC TCC GCC CAC CCT CTG CCC TGC GAG TGA CCC CTC TGC CAA CCT CTC 
TCC ACC CGG GTC GGA GAC GCG ACC CTC ACT GGC GAC ACG GTT GGA GAC 

1690 1700 1710 1720 1730 

• # * * * 

TCC CTA CA GCG CAG CCC CGA GAG CCA CAG GTG TAC ACC CTG CCC CCA TCC 
AGG GAT CT CCC GTC GGG CCT CTC GGT GTC CAC ATG TCG GAC GGG GGT AGG 
Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser> 





1740 
• 




1750 
* 


1760 
* 






1770 
* 


1780 
• 


CAG 
GTC 
Gin 


GAG 

CTC 
Glu 


CAG 
CTC 
Glu 


ATG 
TAC 
Met 


ACC 
TGG 
Thr 


AAG AAC 
TTC TTC 
Lys Asn 


CAG 
CTC 
Gin 


GTC 
CAG 
Val 


AGC 
TCG 
Ser 


CTG ACC TCC CTG 
GAC TCG ACC GAC 
Leu Thr Cys Leu 


GTC AAA 
CAG TTT 
Val Lys> 




1790 
* 






1800 
• 




1810 




1820 
* 




GGC 
CCC 
Cly 


TTC 
AAG 
Phe 


TAC 
ATC 
Tyr 


CCC 
GGG 
Pro 


AGC 
TCG 
Ser 


GAC ATC 
CTG TAG 
Asp lie 


GCC 
CCG 
Ala 


GTC 
CAC 
Val 


GAG 

CTC 
Glu 


TGC GAG AGC AAT 
ACC CTC TCC TTA 
Trp Glu Ser Asn 


GGG CAG 
CCC GTC 
Gly Gln> 


1830 
• 




1840 




1850 
* 






1860 
* 


1870 
* 




CCG 
GGC 
Pro 


GAG 
CTC 
Glu 


AAC 
TTC 
Asn 


AAC 
TTC 
Asn 


TAC 
ATC 
Tyr 


AAG ACC 
TTC TCG 
Lys Thr 


ACG 
TCC 
Thr 


CCT 
GGA 
Pro 


CCC 
CGC 
Pro 


GTC 
CAC 
Val 


CTG GAC TCC 
GAC CTC AGG 
Leu Asp Ser 


GAC GGC 
CTG CCG 
Asp Gly> 


1880 
• 






1890 
* 


1900 
* 




1910 
* 


1920 
* 


TCC 
AGG 
Ser 


TTC 
AAG 
Phe 


TTC 
AAC 
Phe 


CTC 
CAG 
Leu 


TAC 
ATC 

Tyr 


AGC AGG 
TCC TCC 
Ser Ary 


CTA 
GAT 
Leu 


ACC 
TGC 
Thr 


CTG 
CAC 
Val 


GAC 
CTC 
Asp 


AAC AGC AGG 
TTC TCG TCC 
Lys Ser Ary 


TGG CAG 
ACC GTC 
Trp Gln> 
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FIG. 4 E 



1930 1940 1950 1960 1970 

• # * * * 

GAG GGG AAT CTC TTC TCX TCC TCC CTG ATC CAT GAG GCT CTG CAC AAC 
CTC CCC TTA CAG AAG ACT ACG AGG CAC TAC GTA CTC CCA GAC GTG TTC 
Glu Gly Afin Val Pbe Ser Cye Ser Val Met His Glu Ala Lmi His Asn> 

1980 1990 2000 2010 2020 

* • • * * 

CAC TAC ACA CAG AAG AGC CTC TCC CTG TCT CTC GGT AAA T GAG TGC CAG 
GTG ATG TGT GTC TTC TCG GAG AGG GAC AGA GAC CCA TTT A CTC ACG CTC 
His Tyr Thr Gin Lye Ser Leu Ser Leu Ser Leu Gly Lys Xxx> 

2030 2040 2050 2060 2070 

* • * * * 

GGC CGG CAA GCC CCC GCT CCC CGG GCT CTC GGG GTC GCC CGA GGA TGC 
CCG GCC GTT CGG GGG CGA GGG GCC CGA GAG CCC CAG CGC GCT CCT ACG 

2090 2090 2100 2110 

* * * * 

TTG GCA CCT ACC CCG TCT ACA TAC TTC CCA GGC ACC CAG CAT CGA AAT 
AAC CCT GCA TGG GGC AGA TGT ATG AAG GGT CCG TGG GTC GTA CCT TTA 

2120 2130 2140 2150 2160 

* * * * * 

AAA GCA CCC ACC ACT GCC CTG CGC CCC TGT GAG ACT CTG ATG GTT CTT 
TTT CCT GGC TGG TGA CGG GAC CCG GGG ACA CTC TGA CAC TAC CAA GAA 

2170 2180 2190 2200 2210 

* • • * * 

TCC ACG GGT CAG GCC GAG TCT GAG GCC TGA GTG ACA TGA GGG AGG CAG 
AGG TGC CCA CTC CGG CTC AGA CTC CGG ACT CAC TGT ACT CCC TCC GTC 

2220 2230 2240 2250 2260 

* * ♦ * * 

AGC GGG TCC CAC TCT CCC CAC ACT GGC CCA GGC TGT GCA GGT GTG CCT 
TCG CCC AGG GTG ACA GGG GTG TGA CCG GGT CCG ACA CCT CCA CAC GGA 

2270 2280 2290 2300 2310 

♦ • * * * 

GGG CCA CCT AGG GTG GGG CTC AGC CAG GGG CTC CCC TCG GCA GGC TGG 
CCC GGT GGA TCC CAC CCC GAG TCG CTC CCC GAC GGG ACC CCT CCC ACC 

2320 2330 2340 2350 

• • • # 

GGG ATT TGC CAG CCT GGC CCT'CCC. TCC AGC AGC AGG ACT CTA GAG GAT 
CCC TAA ACC GTC GCA CCC GGA GGG AGG TCG TCG TCC TGA GAT CTC CTA 

2360 2370 2380 2390 2400 

• • * # # 

CAT AAT CAG CCA TAC CAC ATT TCT AGA GGT TTT ACT TGC TTT AAA AAA 
CTA TTA CTC GGT ATG CTC TAA ACA TCT CCA AAA TGA ACC AAA TTT TTT 

2410 2420 2430 2440 2450 

♦ * * * # 

CCT CCC ACA CCT CCC CCT GAA CCT GAA ACA TAA AAT GAA TGC AAT TGT 
GGA GGG TCT GGA GGG GGA CTT GGA CTT TCT ATT TTA CTT ACG TTA ACA 
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FIG. 4 F 

2«0 2470 2480 2490 2500 

TGT TCT TAA CTT CTT TAT TCC AGC TTA TAA TGG TTA CAA ATA AAG CAA 
ACA ACA ATT GAA CAA ATA ACG TCC AAT ATT ACC AAT CTT TAT TTC CTT 

2510 2520 2530 2540 2550 

» 

TAG CAT CAC AAA TTT CAC AAA TAA ACC ATT ITT TTC ACT CCA TTC TAG 
ATC GTA CTC TTT AAA CTC TTT ATT TCC TAA AAA AAC TGA CCT AAC ATC 

2560 2570 25B0 2590 

* * * • 

TTC TGG TTT CTC CAA ACT CAT CAA TGT ATC TTA TCA TGT CTG GAT CCT 
AAC ACC AAA CAG GTT TGA GTA CTT ACA TAG AAT ACT ACA GAC CTA GGA 

2600 2610 2620 2630 2640 

* * * # * 

CTA CCC CGG ACG CAT CGT GGC CGG CAT CAC CGC CGC CAC AGG TGC GGT 
GAT GCG GCC TCC GTA GCA CCC CCC GTA GTG GCC CCC GTC TCC ACG CCA 

2650 2660 2670 2680 2690 

* * # * # 

TGC TGG CGC CTA TAT CGC CGA CAT CAC CGA TGG GGA AGA TOG GGC TOG 
ACG ACC GCG GAT ATA GCG GCT GTA GTG GCT ACC CCT TCT AGC CCG AGC 

2700 2710 2720 2730 2740 

* + + + « 

CCA CTT CGG CCT CAT GAG CCC TTC TTT CGG CGT GGG TAT GGT GGC AGG 
GGT GAA GCC CGA GTA CTC CCG AAC AAA GCC GCA CCC ATA CCA CCG TCC 

2750 2760 2770 2780 2790 

♦ * * * « 

CCC GTC CCC GGG GGA CTG TTC GGC GCC ATC TCC TTG CAT GCA CCA TTC 
GGG CAC CGG CCC CCT GAC AAC CCG CGG TAG AGG AAC GTA CCT CGT AAC 

2800 2810 2820 2830 

* * • * 

CTT CCG GCC CCG GTC CTC AAC GGC CTC AAC CTA CTA CTG CGC TGC TTC 
GAA CGC CGC CGC CAC GAG TTC CCG GAG TTG GAT CAT GAC CCG ACC AAG 

2840 2850 2860 2870 2880 

* • ♦ * * 

CTA ATG CAG GAG TCC CAT AAG GGA GAG CCT CGA CCT CGG GCC CCG TTC 
GAT TAC GTC CTC AGC CTA TTC CCT CTC CCA GCT GGA GCC CCC CGC AAC 

2890 2900 2910 2920 2930 

* » * * * 

CTC GCC TTT TTC CAT AGG CTC CGC CCC CCT GAC GAC CAT CAC AAA AAT 
GAC CGC AAA AAG GTA TCC GAG GCC GGG GGA CTC CTC CTA CTC TTT TTA 

2940 2950 2960 2970 2980 

* • • # * 

CGA CGC TCA ACT CAG AGG TGC CGA AAC CCC ACA GGA CTA TAA AGA TAC 
CCT CCC ACT TCA CTC TCC ACC CCT TTC GGC TCT CCT GAT ATT TCT ATG 
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FIG. 4 G 



2990 3000 3010 3020 3030 

• • * • # 

CAC GCC TTT CCC CCT CCA AGC TCC CTC GTC CCC TCT CCT GTT CCG ACC 
CTC CCC AAA CGC CGA CCT TCC ACC GAG CAC CCC ACA CCA CAA GCC TGG 

3040 3050 3060 3070 

• ♦ • * 

CTG CCC CTT ACC CGA TAC CTG TCC GCC TIT CTC CCT TCC CGA ACC GTC 
CAC GCC GAA TGG CCT ATC GAC AGG CCG AAA CAC CGA ACC CCT TCC CAC 

3080 3090 3100 3110 3120 

• • • * * 

CCG CTT TCT CAA TCC TCA CCC TCT ABC TAT CTC ACT TCC GTC TAG GTC 
CGC GAA AGA CTT ACC ACT CCC ACA TCC ATA GAC TCA ACC CAC ATC CAC 

3130 3140 3150 3160 3 i 70 

GTT CGC TCC AAG CTG CGC TGT CTG CAC GAA CCC CCC CTT CAC CCC GAC 
C^GCGAGGTTCGACCCGAC^C^CCTCOTGGGGTO 



3"0 3190 3200 3210 



+ 



3220 



CCC TGC GCC TTA TCC GGT AAC TAT CCT CTT GAG TCC AAC CCG GTA AGA 
GCC ACG CCG AAT AGG CCA TTG ATA GCA GAA CTC ACC TTC GCC CAT TCT 



3230 3240 3250 3260 



3270 



CAC GAC TTA TCG CCA CTC GCA GCA GCC ACT CCT AAC ACG ATT ACC AGA 
GTG CTG AAT ACC GGT GAC CCT CCT CGC TGA CCA TTC TCC TAA TCC TCT 

3280 3290 3300 3310 

+ ♦ * » 

GCG AGG TAT GTA GGC GGT GCT ACA GAG TTC TTG AAG TGG TGG CCT AAC 
CGC TCC ATA CAT CCC CCA CGA TCT CTC AAC AAC TTC ACC ACC GGA TTG 

3320 3330 3340 3350 3360 

* * • • * 

TAC GGC TAC ACT AGA ACG ACA GTA TTT GGT ATC TCC GCT CTG CTG AAC 
ATC CCC ATC TGA TCT TCC TCT CAT AAA CCA TAC ACC CGA GAC GAC TTC 

3370 3380 3390 3400 3410 

♦ • * * * 

CCA GTT ACC TTC GGA AAA AGA GTT CCT ACC TCT TCA TCC CCC AAA CAA 
GCT CAA TCG AAG CCT TTT TCT CAA CCA TCC ACA ACT ACG CCG TIT GTT 

3420 3430 3440 3450 3460 

* * ♦ # « 

ACC ACC GCT GGT ACC GGT GGT TTT TTT GTT TGC AAG CAG CAC ATT ACG 
TGG TGG CGA CCA TCG CCA CCA AAA AAA CAA ACC TTC CTC CTC TAA TCC 

3470 3480 3490 3500 3510 

• • * * # 

CCC AGA AAA AAA GGA TCT CAA GAA GAT CCT TTC ATC TTT TCT ACC CGG- 
CCC TCT TTT TTT CCT ACA CTT CTT CTA CGA AAC TAG AAA AGA TGC CCC 
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FIG. 4 H 



3520 3530 3540 3550 



* 



TCT GAC GCT CAC TCC AAC GAA AAC TCA CCT TAA GGG ATT TTG GTC ATG 
ACA CTC CGA CTC ACC TTC CTT TTC ACT GCA ATT CCC TAA AAC CAG TAC 

3560 3570 3580 3590 3600 

ACA TTA TCA AAA ACG ATC TTC ACC TAC ATC CTT TTA AAT TAA AAA TCA 
TCT AAT ACT TTT TCC TAC AAC TCC ATC TAC CAA AAT TTA ATT TIT ACT 

3610 3620 3630 3640 3650 

ACT TTT AAA TCA ATC TAA ACT ATA TAT GAG TAA ACT TCC TCT GAC ACT 
TCA AAA TTT ACT TAC ATT TCA TAT ATA CTC ATT TGA ACC ACA CTC TCA 

3660 3670 36B0 3690 3700 

TAC CAA TCC TTA ATC ACT GAC CCA CCT ATC TCA CCG ATC TCT CTA TTT 
ATC CTT ACC AAT TAC TCA CTC CCT GGA TAC ACT CCC TAC ACA GAT AAA 



3710 3720 3730 3740 



3750 



CGT TCA TCC ATA GTT CCC TGA CTC CCC CTC CTC TAC ATA ACT ACG ATA 
CCA ACT ACG TAT CAA CGG ACT GAG GGG CAG CAC ATC TAT TGA TCC TAT 

3760 3770 3780 3790 

* * * 

CGG GAG GCC TTA CCA TCT GGC CCC ACT CCT GCA ATC ATA CCG CGA GAC 
GCC CTC CCG AAT GCT ACA CCG GGG TCA CGA CCT TAC TAT GGC CCT CTC 



3800 3810 3820 3830 



3840 



CCA CGC TCA CCG CCT CCA GAT TTA TCA GCA ATA AAC CAG CCA GCC GGA 
GGT GCG AGT GGC CGA GGT CTA AAT ACT CGT TAT TTG GTC CGT CGG CCT 

3850 3860 3870 3880 



3890 



AGG GCC GAC CGC AGA ACT CGT CCT GCA ACT TTA TCC GCC TCC ATC CAG 
TCC CGG CTC CCG TCT TCA CCA GGA CGT TCA AAT AGG CCG ACG TAG GTC 



3900 3910 3920 3930 



* 



♦ 



3940 



TCT ATT AAT TCT TCC CGG GAA GCT AGA GTA AGT AGT TCG CCA GTT AAT 
ACA TAA TTA ACA ACC GCC CTT CGA TCT CAT TCA TCA ACC GGT CAA TTA 



3950 3960 3970 3990 



3990 



ACT TTG CCC AAC CTT GTT GCC ACT GCT ACA CGC ATC CTC CTC TCA CGC 
TCA AAC GCG TTG CAA CAA CGG TAA CGA TCT CCG TAG CAC CAC ACT CCC 

4000 4010 4020 4030 

TCC TCG TTT GGT ATC GCT TCA TTC AGC TCC GCT TCC CAA CGA TCA ACC 
AGC ACC AAA CCA TAC CGA ACT AAC TCG ACG CCA AGG GTT CCT ACT TCC 
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FIG. 4 I 



4040 4050 4060 4070 4080 

* • * • • 

CGA GTT ACA TGA TCC CCC ATC TTC TCC AAA AAA CCC GTT AGC TCC TTC 
OCT CAA TCT ACT ACC CGG TAC AAC ACC TIT TTT CCC CAA TCC ACC AAG 

4090 4100 4X10 4120 4130 

• • • • # 

CCT CCT CCC ATC CTT CTC ACA ACT AAC TTC CCC CCA CTC TTA TCA CTC 
CCA CGA CCC TAC CAA CAG TCT TCA TTC AAC CCC CCT CAC AAT ACT GAG 

4140 4150 4160 4170 4180 

* # * • » 

ATC CTT ATC CCA CCA CTC CAT AAT TCT CTT ACT CTC ATC CCA TCC CTA 
TAC CAA TAC CCT CCT CAC CTA TTA ACA CAA TCA CAC TAC CCT ACC CAT 

4190 4200 4210 4220 4230 

• • * * * 

ACA TGC TTT TCT GTG ACT GCT GAG TAC TCA ACC AAG TCA TTC TGA GAA 
TCT ACC AAA ACA CAC TCA CCA CTC ATC ACT TCC TTC ACT AAG ACT CTT 

4240 4250 4260 4270 

• • * * 

TAG TCT ATC CCC CCA CCC ACT TCC TCT TCC CCC CCC TCA ACA CCC CAT 
ATC ACA TAC CCC GCT CCC TCA ACQ AGA ACC CCC CCC ACT TCT CCC CTA 

4280 4290 4300 4310 4320 

* * * * * 

AAT ACC GCG CCA CAT AGC AGA ACT TTA AAA GTG CTC ATC ATT CGA AAA 
TTA TGG CGC GGT GTA TCG TCT TGA AAT TTT CAC GAG TAG TAA CCT TTT 

4330 4340 4350 4360 4370 

* * * + * 

CGT TCT TCG GCG CGA AAA CTC TCA AGG ATC TTA CCC CTC TTG AGA TCC 
GCA AGA AGC CCC GCT TTT GAG ACT TCC TAG AAT CGC CAC AAC TCT AGG 

4380 4390 4400 4410 4420 

* * * * * 

ACT TCG ATG TAA CCC ACT CCT GCA CCC AAC TGA TCT TCA GCA TCT TTT 
TCA ACC TAC ATT CGC TGA GCA CGT CCC TTC ACT AGA ACT CCT AGA AAA 

4430 4440 4450 4460 4470 

• * # * # 

ACT TTC ACC AGC CTT TCT GGC TGA GCA AAA ACA CGA AGG CAA AAT GCC 
TGA AAG TGG TCG CAA AGA CCC ACT CCT TTT TCT CCT TCC CTT TTA CCC 

4480 4490 4500 4510 

* * * » 

GCA AAA AAG GGA ATA AGC GCG ACA CGG AAA TCT TGA ATA CTC ATA CTC 
CGT TTT TTC CCT TAT TCC CCC TCT GCC TTT ACA ACT TAT GAG TAT GAG 

4520 4530 4540 4550 4560 

* * * * » 

TTC CTT TTT CAA TAT TAT TGA AGC ATT TAT CAG GCT TAT TCT CTC ATC 
AAG GAA AAA CTT ATA ATA ACT TCC TAA ATA GTC CCA ATA ACA GAG TAC 
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FIG. 4 J 



4570 4560 4590 4600 4610 

* • * * * 

ACC CCA TAC ATA TIT GAA TCT ATT TAG AAA AAT AAA GAA ATA GGC CTT 
TCC CCT ATC TAT AAA CTT ACA TAA ATC TTT TTA TTT CTT TAT CCC CAA 

4620 4630 4640 4650 4660 

* • * • • 

CCC CCC ACA TTT CCC CCA AAA CTC CCA CCT GAC CTC TAA CAA ACC ATT 
CCC CCC TCT AAA CCC CCT TTT CAC CCT CCA CTC CAC ATT CTT TCC TAA 

4670 4680 4690 4700 4710 

* • • • ♦ 

ATT ATC ATC ACA TTA ACC TAT AAA AAT ACC CCT ATC ACC ACC CCC TGA 
TAA TAG TAC TCT AAT TCC ATA TTT TTA TCC CCA TAC TCC TCC CGC ACT 

4720 4730 4740 4750 

* • * • 

TCC CTC TTT CCC CCA CCC ATC CTT CCT AAT CTT CCC TCC CAC CCA CCA 

ACC GAC AAA CCC CCT CCC TAC CAA CCA TTA CAA CCC ACC CTC CCT CCT 

4760 4770 4780 4790 4800 

* • • • • 

CAA CCC TCA AGA CAA AAT CTA ATC ACA CTC CCT CAC CTT CCC CTC CCC 
CTT CGG ACT TCT CTT TTA CAT TAG TCT GAC CCA CTC CAA CCC CAC CCG 

4810 4820 4830 4840 4850 

* * • * * 

CTT TCT CCC TTT ATA ACC ACA CAC TTT ATC TTT AAG AAC GTT GOT AAA 
GAA AGA CCC AAA TAT TCC TCT CTC AAA TAC AAA TTC TTC CAA CCA TTT 

* 

4360 487 <> 4880 4890 4900 

TTC CTT GCG GCT TTG GCA GCC AAG CTA GAC ATC TCT AGC TTC GTG TCA 
AAG GAA CGC CGA AAC CGT CGG TTC GAT CTC TAG AGA TOG AAG CAC ACT 

4910 4920 4930 4940 4950 

ACG ACC GTG ACT GCA GTG AAT AAT AAA ATG TGT GTT TCT CCG AAA TAC 
TCC TGC CAC TGA CCT CAC TTA TTA TTT TAC ACA CAA ACA CGC TTT ATG 

4960 4970 4980 4990 

TT? T2!i 2i? TTC CGC CGIi CTA ' rC * TGT CGC 000 ATA GTG 

CCC AAA ACT CTA AAG ACA GCG GCT GAT TTA ACT ACA GCG CGC TAT CAC 



5000 5010 5020 5030 



5040 



GTG TTT ATC GCC CAT AGA GAT GGC GAT ATT GCA AAA ATC GAT ATT TGA 
CAC AAA TAG CCG CTA TCT CTA CCG CTA TAA CCT TTT TAG CTA TAA ACT 



5050 5060 5070 5080 5090 

* * * • « 

AAA TAT GGC ATA TTG AAA ATG TCG CCG ATC TGA GTT TCT GTG TAA CTC 
TTT ATA CCC TAT AAC TTT TAC AGC GGC TAC ACT CAA AGA CAC ATT GAC 
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FIG. 4 K 



5100 5110 5120 5130 5140 

* * • * * 

ATA TCC CCA TTT TTC CAA AAC TGA TIT TTG GGC ATA CCC GAT ATC TGG 
TAT ACC CGT AAA AAG CTT TTC ACT AAA AAC CCC TAT CCC CTA TAG ACC 

5150 5160 5170 5180 5190 

* # * * * 

CGA TAG CGC TTA TAT CGT TT A CGG GGG ATC GCG ATA GAC GAC TTT GGT 
GCT ATC GCG AAT ATA CCA AAT GCC CCC TAC CGC TAT CTC CTG AAA CCA 

5200 5210 5220 5230 

* • * * 

GAC TTG GGC GAT TCT CTG TCT CCC AAA TAT CGC ACT TTC GAT ATA GGT 
CTG AAC CCC CTA AGA CAC ACA GCG TTT ATA GCG TCA AAC CTA TAT CCA 

5240 5250 5260 5270 5280 

* • # • * 

GAC AGA CGA TAT GAG GCT ATA TCG CCC ATA GAG GCG ACA TCA ACC TGG 
CTG TCT CCT ATA CTC CCA TAT AGC GGC TAT CTC CCC TCT ACT TCC ACC 

5290 5300 5310 5320 5330 

* • • * * 

CAC ATC GCC AAT GCA TAT CGA TCT ATA CAT TGA ATC AAT ATT GGC CAT 
GTG TAC CGG TTA CCT ATA GCT AGA TAT CTA ACT TAG TTA TAA CCG GTA 

5340 5350 5360 5370 5380 

* • * * * 

TAG CCA TAT TAT TCA TTC CTT ATA TAG CAT AAA TCA ATA TTC GCT ATT 
ATC GGT ATA ATA AGT AAC CAA TAT ATC GTA TTT ACT TAT AAC CGA TAA 

5390 5400 5410 5420 5430 

* • * » « 

GGC CAT TGC ATA CGT TGT ATC CAT ATC ATA ATA TCT ACA TTT ATA TTG 
CCG CTA ACC TAT GCA ACA TAG CTA TAG TAT TAT ACA TOT AAA TAT AAC 

5440 5450 5460 5470 

* * * * 

GCT CAT GTC CAA CAT TAC CGC CAT GTT GAC ATT GAT TAT TGA CTA GTT 
CGA GTA GAG CTT GTA ATG GCG GTA CAA CTG TAA CTA ATA ACT GAT CAA 

5480 5490 5500 5510 5520 

* * * * + 

ATT AAT AGT AAT CAA TTA CGG GGT CAT TAG TTC ATA GCC CAT ATA TGG 
TAA TTA TCA TTA GTT AAT GCC CCA GTA ATC AAG TAT CGG GTA TAT ACC 

5530 5540 5550 5560 5570 

* * * * * 

AGT TCC GCG TTA CAT AAC TTA CGG TAA ATG GCC CGC CTG GCT GAC CGC 
TCA AGG CGC AAT GTA TTG AAT GCC ATT TAC CCG GCG GAC CGA CTG GCG 

5580 5590 5600 5610 5620 

* * * * * 
CCA ACG ACC CCC GCC CAT TGA CGT CAA TAA TGA CCT ATC TTC CCA TAG 
GGT TGC TGG GGG CGG CTA ACT GCA GTT ATT ACT GCA TAC AAG GGT ATC 
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FIG. 4 L 



5630 5640 5650 5660 5670 

* * * + * 

TAX CGC CAA TAG GGA CTT TCC ATT GAC GTC AAT GGG TGG ACT ATT TAC 
ATT GCG GTT ATC CCT GAA ACG TAA CTG CAG TTA CCC ACC TCA TAA ATG 

56B0 5690 5700 5710 

• • ♦ * 

GGT AAA CTC CCC ACT TGG CAG TAC ATC AAG TGT ATC ATA TGC CAA GTA 
CCA TTT GAC GGG TGA ACC GTC ATC TAG TTC ACA TAG TAT ACG GTT CAT 

5720 5730 5740 5750 5760 

• * • * * 

CGC CCC CTA TTC ACG TCA ATG ACG GTA AAT GGC CCC CCT GGC ATT ATC 
GCG GGG GAT AAC TGC ACT TAC TCC CAT TTA CCC CCC GGA CCG TAA TAC 

5770 5780 5790 5800 5B10 

• * * * # 

CCC ACT ACA TGA CCT TAT GGG ACT TTC CTA CTT CCC ACT ACA TCT ACC 
GGG TCA TCT ACT GGA ATA CCC TGA AAG GAT GAA CCG TCA TCT AGA TGC 

5820 5830 5840 5850 5860 

* • * • * 

TAT TAG TCA TCC CTA TTA CCA TGG TGA TCC CCT TTT GGC ACT ACA TCA 
ATA ATC ACT AGC GAT AAT GGT ACC ACT ACC CCA AAA CCC TCA TCT ACT 

5870 5880 5890 5900 5910 

* * * • * 
ATG GGC CTC GAT ACC CCT TTC ACT CAC GGG GAT TTC CAA CTC TCC ACC 
TAC CCC CAC CTA TCC CCA AAC TGA GTC CCC CTA AAG GTT CAG ACG TGG 

5920 5930 5940 5950 

• ♦ ♦ 

CCA TTG ACG TCA ATG GGA GTT TGT TTT GGC ACC AAA ATC AAC GGG ACT 
GGT AAC TCC ACT TAC CCT CAA ACA AAA CCG TGG TTT TAG TTG CCC TGA 

5960 5970 5980 5990 6000 

• • • * * 

TTC CAA AAT GTC GTA ACA ACT CCC CCC CAT TGA CGC AAA TGG GCG GTA 
AAG GTT TTA CAG CAT TCT TGA GGC GGG GTA ACT GCG TTT ACC CGC CAT 

6010 6020 6030 6040 6050 

* * * • # 

GGC GTG TAC GGT GGC AGG TCT ATA TAA GCA GAG CTC GTT TAG TGA ACC 
CCG CAC ATC CCA CCC TCC AGA TAT ATT CCT CTC GAG CAA ATC ACT TGG 

6060 6070 6080 6090 6100 

* • ♦ * * 

CTC AGA TCC CCT GGA GAC GCC ATC CAC GCT GTT TTG ACC TCC ATA GAA 
CAG TCT AGC GGA CCT CTC CGG TAG GTG CGA CAA AAC TGG AGG TAT CTT 

6110 6120 6130 6140 6150 

* * * * * 
GAC ACC GGG ACC GAT CCA GCC TCC GCG GCC GGG AAC GGT GCA TTG GAA 
CTG TGG CCC TGG CTA GGT CGG AGG CGC CGG CCC TTG CCA CCT AAC CTT 
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FIG. 4 M 

6160 6170 6180 6190 

* * • * 

CCC GGA TTC CCC GTC CCA AGA CTG ACQ TAA GTA CCC CCT ATA GAG TCT 

GCG CCT AAG GGG CAC GGT TCT CAC TGC ATT CAT GGC GGA TAT CTC AGA 

6200 6210 6220 6230 6240 

* * * • » 

ATA GGC CCA CCC CCT TGC CTT CTT ATG CAT GCT ATA CTG TTT TTG GCT 
TAT CCC GGT CGG GGA ACC GAA GAA TAC GTA CCA TAT GAC AAA AAC CGA 

6250 6260 6270 6280 6290 

# • • * • 

TCG CCT CTA TAC ACC CCC GCT TCC TCA TCT TAT AGG TGA TGG TAT AGC 
ACC CCA GAT ATG TGG CCC CGA AGG ACT ACA ATA TCC ACT ACC ATA TCG 

6300 6310 6320 6330 6340 

• * * * * 

TTA GCC TAT ACC TCT GGG TTA TTC ACC ATT ATT GAC CAC TCC CCT ATT 
AAT CGG ATA TCC ACA CCC AAT AAC TGC TAA TAA CTC CTC AGC CGA TAA 

6350 6360 6370 63B0 6390 

* • # * • 

CCT CAC GAT ACT TTC CAT TAC TAA TCC ATA ACA TGC CTC TTT CCC ACA 
CCA CTC CTA TGA AAC GTA ATG ATT AGG TAT TCT ACC GAG AAA CGG TCT 

6400 6410 6420 6430 

* * * * 

ACT CTC TTT ATT GCC TAT ATG CCA ATA CAC TCT CCT TCA GAG ACT GAC 
TCA GAG AAA TAA CCC ATA TAC CCT TAT CTG ACA GGA ACT CTC TGA CTC 

6440 6450 6460 6470 6480 

• • • • * 

ACC GAC TCT CTA TTT TTA CAC GAT GCG CTC TCA TTT ATT ATT TAC AAA 
TCC CTC AGA CAT AAA AAT CTC CTA CCC CAG ACT AAA TAA TAA ATG TTT 

6490 6500 6510 6520 6530 

• * • • • 

TTC ACA TAT ACA ACA CCA CCC TCC CCA GTC CCC CCA CTT TTT ATT AAA 
AAC TCT ATA TCT TCT GGT GGC ACC GGT CAC GCG CCT CAA AAA TAA TTT 

6540 6550 6560 6570 6580 

• • # * * 

CAT AAC CTG GGA TCT CCA CGC GAA TCT CGG CTA CCT CTT CCG GAC ATG 
GTA TTG CAC CCT AGA CCT GCG CTT AGA GCC CAT CCA CAA GGC CTC TAC 

6590 6600 6610 6620 6630 

# * * * » 

GGC TCT TCT CCG GTA GCG GCG GAG CTT CTA CAT CCG AGC CCT CCT CCC 
CCG AGA AGA GGC CAT CCC CCC CTC GAA GAT CTA CGC TCG GGA CCA GGC 

6640 6650 6660 6670 

* * m m 

ATG CCT CCA GCG ACT CAT GCT CGC TCG GCA GCT CCT TGC TCC TAA CAG 
TAC GGA GGT CGC TGA GTA CCA CCC AGC OCT CCA GGA ACG AGG ATT GTC 
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FIG. 4 N 



6680 6690 6700 6710 6720 

* + • * * 

TCG ACC CCA CAC TTA GGC ACA CCA CGA TGC CCA CCA CCA CCA CTG TCC 
ACC TCC GOT CTG AAT CCC TGT CGT GCT ACG OCT OCT GGT GCT CAC ACG 

6730 6740 6750 6760 6770 

* * * ♦ ♦ 

CCC ACA ACG CCC TCG CCG TAG GGT ATG TGT CTC AAA ATG ACC TCC GGG 
CCG TGT TCC GGC ACC GCC ATC CCA TAG ACA GAC TTT TAC TCG ACC CCC 

6700 6790 6800 6810 6820 

ACC GGC CTT CCA CCC CTC ACC CAT TTC GAA GAC TTA ACG CAC CCG CAC 
TCG CCC GAA CGT GGC GAC TGC GTA AAC CTT CTG AAT TCC GTC GCC CTC 

6830 6840 6850 6860 6870 

* • * * * 

AAC AAC ATC CAC CCA GCT GAC TTC TTC TCT TCT GAT AAC ACT CAC ACC 
TTC TTC TAC GTC CGT CGA CTC AAC AAC ACA ACA CTA TTC TCA GTC TCC 

6880 6890 6900 6910 

* * * * 

TAA CTC CCC TTC CCC TCC TCT TAX CCG TCC ACC GCA CTC TAG TCT GAC 
ATT GAC GCC AAC GCC ACC ACA ATT CCC ACC TCC OCT CAC ATC ACA CTC 

6920 6930 6940 6950 6960 

• • * • * 

CAC TAC TCC TTC CTC CCC CCC CCC CCA CCA GAC ATA ATA GCT GAC ACA 
CTC ATC ACC AAC GAC GGC CCC CCC GGT GCT CTC TAT TAT CGA CTG TCT 

6970 6980 6990 7000 7010 

• * * * * 

CTA ACA GAC TCT TCC TTT CCA TCG GTC TTT TCT CCA CTC ACC CTC CTT 
GAT TCT CTC ACA ACC AAA GCT ACC CAC AAA ACA CGT CAC TCC CAC GAA 

7020 7030 7040 7050 7060 

* • * • * 

GAC ACG AAC CTT GGC CTC CAC GTC GAT CGA CTC TAG ACG ATC GAT CCC 
CTC TGC TTC GAA CCC GAC CTC CAC CTA GOT GAC ATC TCC TAC CTA GGG 

7070 

CCG GCC ACC TC 
CCC CCC TCC AC 
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-.-^ _ . The pEe12TF8LCDR3 expression vector DNA sequence. The 
FIG. 5 A codin 9 re 9'ons of the TF8-5G9 CDR-grafted LC gene, TF8LCDR3, 



are translated. 
Sequence Range: 1 to 7864 



10 20 30 40 50 



AAT TCA CC ATG GGT GTC CCA ACT CAG CTA TTA GGA TTA CTG CTC CTG TGG 
TTA ACT GG TAC CCA CAC GGT TGA GTC CAT AAT CCT AAT GAC GAC GAC ACC 
Met Gly Val Pro Thr Gin Val Leu Gly Leu Leu Leu Leu Trp> 

60 70 80 90 









* 












* 








* 






CTT 
GAA 
Leu 


ACA 
TGT 
Thr 


GAT 
CTA 
Afip 


GCA 
CCT 
Ala 


AGA 
TCT 
Arg 


TCT 
ACA 
Cyo 


GAT 
CTA 
Asp 


ATC 
TAG 
lie 


CAA 
GTT 
Gin 


ATG 
TAC 
Met 


ACA 
TGT 
Thr 


CAA 
GTT 
Gin 


TCT 
AGA 
Ser 


CCT 
GGA 
Pro 


TCT 
AGA 
Ser 


TCT 
AGA 
Ser> 


100 
♦ 






110 
* 






120 
* 




130 






140 






CTA 
CAT 
Leu 


ACT 
TCA 
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FIG. 5 B 

440 450 460 470 



480 



GAG CAG TTG AAA TCT GGA ACT GCC TCT GTT GTG TGC CTG CTG AAT AAC 
CTC GTC AAC TTT AGA CCT TGA CGG AGA CAA CAC ACG GAC GAC TTA TTG 
Glu Gin Leu Lye Ser Gly Thr Ala Ser Val Val Cye Leu Leu Asn Asn> 

490 500 510 520 530 

♦ * * * * 

TTC TAT CCC AGA GAG GCC AAA GTA CAG TGG AAG GTG GAT AAC GCC CTC 
AAG ATA GGG TCT CTC CGG TTT CAT GTC ACC TTC CAC CTA TTG CGG GAG 
Phe Tyr Pro Arg Glu Ala Lye Val Gin Trp Lye Val Asp Aen Ala Leu> 

540 550 560 570 

* * * * 

CAA TCG GGT AAC TCC CAG GAG AGT GTC ACA GAG CAG GAC AGC AAG GAC 
GTT AGC CCA TTG AGG GTC CTC TCA CAG TGT CTC GTC CTG TCC TTC CTG 
Gin Ser Gly Am Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp> 

5B0 590 600 610 620 

+ * * # * 

AGC ACC TAC AGC CTC AGC AGC ACC CTG ACG CTG AGC AAA GCA GAC TAC 
TCG TCG ATG TCC GAG TCG TCG TGG GAC TGC GAC TCC TTT CCT CTG ATG 
Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lye Ala Aep Tyr> 

630 640 650 660 670 

* * * * * 

GAG AAA CAC AAA GTC TAC GCC TGC GAA GTC ACC CAT CAG GGC CTC AGC 
CTC TTT CTC TTT CAG ATG CGG ACG CTT CAG TGG GTA GTC CCG GAC TCG 
Glu Lye Hi a Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser> 

680 690 700 710 720 

« * * * * 

TCG CCC CTC ACA AAG AGC TTC AAC AGG GGA GAG TCT T AGA CGG AGA AGT 
AGC GGG CAG TGT TTC TCG AAG TTG TCC CCT CTC ACA A TCT CCC TCT TCA 
Ser Pro Val Thr Lys Ser Phe Asn Ary Gly Glu Cys> 

730 740 750 760 770 

• # * * * 

CCC CCC ACC TCC TCC TCA GTT CCA GCC TGC GGA TCA TAA TCA CCC ATA 
CGG GGG TGG ACG AGG AGT CAA GGT CCG ACC CCT AGT ATT ACT CCG TAT 

780 790 800 810 

• * • * 

CCA CAT TTG TAG AGC TTT TAC TTC CTT TAA AAA ACC TCC CAC ACC TCC 
GGT GTA AAC ATC TCC AAA ATG AAC GAA ATT TTT TGG AGC GTC TCG AGC 

820 830 840 850 860 

* • * • • 

CCC TGA ACC TCA AAC ATA AAA TGA ATC CAA TTC TTC TTC TTA ACT TCT 
GGG ACT TGG ACT TTC TAT TTT ACT TAC GTT AAC AAC AAC AAT TCA ACA 

870 880 890 900 910 

* • * * • 

TTA TTG CAG CTT ATA ATG CTT ACA AAT AAA GCA ATA GCA TCA CAA ATT 
AAT AAC CTC CAA TAT TAC CAA TCT TTA TTT CCT TAT CCT AGT CTT TAA 



RECTIFIED SHEET (RULE 91) 

ISA/EP 



WO 96/40921 



PCTAJS96/09287 



25/41 

FIG. 5 C 



920 930 940 950 960 

* * • • # 
TCA CAX ATA AAC CAT TTT TIT CAC TCC ATT CTA CTT CTG CTT TGT CCA 
ACT CTT TAT TTC CTA AAA AAA GTC ACC TAA GAT CAA CAC CAA ACA CGT 

970 980 990 1000 1010 

* * * 

AAC TCA TCA ATG TAT CTT ATC ATG TCT GGA TCC TCT ACG CCG GAC GCA 
TTG ACT ACT TAG ATA GAA TAG TAC AGA CCT AGG AGA TGC GGC CTG CGT 

1020 1030 1040 1050 

♦ * ♦ « 

TCG TGG CCG GCA TCA CCG GCG CCA CAG GTG CGG TTG CTG GCG CCT ATA 

AGC ACC GGC CGT AGT GGC CGC GGT GTC CAC GCC AAC GAC CGC GGA TAT 

1060 1070 1080 1090 1100 

* * <* * * 

TCG CCG ACA TCA CCG ATG GGG AAG ATC GGG CTC GCC ACT TCG GGC TCA 
AGC GGC TGT AGT GGC TAC CCC TTC TAG CCC GAG CGG TGA AGC CCG AGT 

1110 1120 1130 1140 1150 

* * * • * 

TGA GCG CTT GTT TCG GCG TGG GTA TGG TGG CAG GCC CGT GGC CGG GGG 
ACT CGC GAA CAA AGC CCC ACC CAT ACC ACC CTC CGG GCA CCG GCC CCC 

1160 1170 1180 1190 1200 

* * * • « 

ACT GTT GGG CGC CAT CTC CTT GCA TGC ACC ATT CCT TGC GGC GGC GGT 
TGA CAA CCC GCG GTA GAG GAA CGT ACG TGG TAA GGA ACG CCG CCG CCA 

1210 1220 1230 1240 1250 

* • * * * 

GCT CAA CGG CCT CAA CCT ACT ACT GGG CTG CTT CCT AAT GCA GGA GTC 
CCA GTT GCC GGA GTT GGA TGA TGA CCC GAC GAA GGA TTA CGT CCT CAG 

1260 1270 1280 1290 

• * * * 

GCA TAA GGG AGA GCG TCG ACC TCC GGC CCC GTT GCT GCC GTT TTT CCA 
CGT ATT CCC TCT CGC AGC TGG AGC CCC GCG CAA CGA CCC CAA AAA GGT 

1300 1310 1320 1330 1340 

• * * * * 

TAG GCT CCG CCC CCC TGA CGA GCA TCA CAA AAA TCG ACC CTC AAG TCA 
ATC CGA GGC GGG GGG ACT GCT CCT AGT GTT TTT AGC TCC GAG TTC AGT 

1350 1360 1370 1380 1390 

* * • • * 

GAC CTG GCC AAA CCC GAC AGG ACT ATA AAG ATA CCA GGC GTT TCC CCC 
CTC CAC CGC TTT GGG CTG TCC TGA TAT TTC TAT GGT CCG CAA AGG GGG 

1400 1410 1420 1430 1440 

• • # * * 

TGC AAC CTC CCT CCT GCG CTC TCC TGT TCC GAC CCT GCC GCT TAC CGC 
ACC TTC GAG GGA GCA CCC GAG AGG ACA AGG CTC GGA CCC CGA ATG GCC 

1450 1460 1470 1480 1490 

* • * * * 

ATA CCT CTC CGC CTT TCT CCC TTC GGC AAG CCT GGC GCT TTC TCA ATG 
TAT GGA CAG GCC GAA AGA GGG AAG CCC TTC CCA CCC CGA AAG AGT TAC 
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FIG. 5 D 



1500 1510 1520 1530 

* * • * 

CTC ACG CTC TAG GTA TCT CAC TTC GOT CTA GOT OGT TCC CTC CAA GCT 
GAG TGC GAC ATC CAT AGA CTC AAG CCA CAT CCA CCA AGC GAG GTT CGA 

1540 1550 1560 1570 1580 

* * * * * 

GGG CTG TGT GCA CGA ACC CCC CGT TCA GCC CCA CCG CTG CCC CTT ATC 
CCC GAC ACA CGT GCT TGG GGG GCA ACT CGG GCT CGC GAC GCG GAA TAG 

1590 1600 1610 1620 1630 

* * * * + 

CGG TAA CTA TCG TCT TGA GTC CAA CCC GGT AAG ACA CGA CTT ATC GCC 
GCC ATT GAT AGC AGA ACT CAG GTT GGG CCA TTC TGT GCT GAA TAG CGG 

1640 1650 1660 1670 1680 

* * * * * 

ACT GGC AGC AGC CAC TGG TAA CAG GAT TAG CAG AGC GAG GTA TGT AGG 
TGA CCG TCG TCG CTG ACC ATT GTC CTA ATC GTC TCG CTC CAT ACA TCC 

1690 1700 1710 1720 1730 

* * * ♦ * 
CGG TGC TAG AGA GTT CTT GAA GTG GTG GCC TAA CTA CGG CTA CAC TAG 
GCC ACG ATC TCT CAA GAA CTT CAC CAC CGG ATT GAT CCC GAT GTG ATC 

1740 1750 1760 1770 

* * ♦ * 

AAG GAC ACT ATT TCG TAT CTG CGC TCT GCT GAA CCC ACT TAC CTT CCG 
TTC CTG TCA TAA ACC ATA GAC GCG AGA CGA CTT CGG TCA ATG GAA GCC 

1780 1790 1800 1810 1820 

* ♦ ♦ • * 

AAA AAG AGT TGG TAG CTC TTG ATC CGG CAA ACA AAC CAC CGC TGG TAG 
TTT TTC TCA ACC ATC GAG AAC TAG GCC GTT TGT TTG GTC CCC ACC ATC 

1830 1840 1850 1860 1870 

* • * * * 

CCG TCC TTT TIT TCT TTG CAA CCA GCA GAT TAC CCG CAG AAA AAA AGG 
CCC ACC AAA AAA ACA AAC CTT CGT CGT CTA ATG CGC CTC TTT TTT TCC 

1880 1890 1900 1910 1920 

• * • * * 

ATC TCA AGA AGA TCC TTT GAT CTT TTC TAC CCG CTC TGA CGC TCA GTG 
TAG AGT TCT TCT AGG AAA CTA GAA AAG ATG CCC CAG ACT GCG AGT CAC 

1930 1940 1950 i960 1970 

♦ • * * * 

GAA CGA AAA CTC ACC TTA ACG GAT TTT GCT CAT GAG ATT ATC AAA AAG 
CTT GCT TTT GAG TGC AAT TCC CTA AAA CCA GTA CTC TAA TAG TTT TTC 

I960 1990 2000 2010 

* • * * 

GAT CTT CAC CTA GAT CCT TTT AAA TTA AAA ATG AAG TTT TAA ATC AAT 
CTA GAA GTC GAT CTA CGA AAA TTT AAT TTT TAC TTC AAA ATT TAG TTA 

2020 2030 2040 2050 2060 

* ♦ » * « 

CTA AAG TAT ATA TCA CTA AAC TTC CTC TGA CAG TTA CCA ATG CTT AAT 
GAT TTC ATA TAT ACT CAT TTC AAC CAC ACT GTC AAT GCT TAC GAA TTA 
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FIG. 5 E 



2070 2080 2090 2100 2110 

* * * * * 

CAG TGA GGC ACC TAT CTC AGC GAT CTG TCT ATT TCG TTC ATC CAT ACT 
GTC ACT CCG TGG ATA GAG TCG CTA GAC AGA TAA AGC AAG TAG GTA TCA 

2120 2130 2140 2150 2160 

* * + • • 

TGC CTG ACT CCC CGT CGT GTA GAT AAC TAC GAT ACG GGA GGG CTT ACC 
ACG GAC TGA GGG GCA GCA CAT CTA TTG ATG CTA TGC CCT CCC GAA TGG 

2170 2180 2190 2200 2210 

♦ # * • * 

ATC TGG CCC CAG TGC TGC AAT GAT ACC CCG AGA CCC ACC CTC ACC GGC 
TAG ACC GGG GTC ACG ACG TTA CTA TGG CGC TCT GGG TGC GAG TGG CCG 

2220 2230 2240 2250 

* * * • 

TCC AGA TTT ATC AGC AAT AAA CCA GCC AGC CCG AAG GGC CGA GCG CAG 
AGG TCT AAA TAG TCG TTA TTT GGT CGC TCG GCC TTC CCC GCT CGC GTC 

2260 2270 2280 2290 2300 

* * # * * 

AAG TGG TCC TGC AAC TTT ATC CGC CTC CAT CCA GTC TAT TAA TTG TTG 
TTC ACC AGG ACG TTG AAA TAG GCC GAG GTA GGT CAG ATA ATT AAC AAC 

2310 2320 2330 2340 2350 

* * # • + 

CCG GGA AGC TAG ACT AAG TAG TTC GCC ACT TAA TAG TTT GCG CAA CGT 
GGC CCT TCC ATC TCA TTC ATC AAG CGC TCA ATT ATC AAA CCC GTT GCA 

2360 2370 2380 2390 2400 



* 



TCT TGC CAT TCC TAC AGG CAT CGT GGT GTC ACC CTC CTC GTT TCG TAT 
ACA ACC CTA ACC ATG TCC GTA GCA CCA CAG TCC GAG CAG CAA ACC ATA 

2410 2420 2430 2440 2450 

♦ • • * * 

GCC TTC ATT CAG CTC CGG TTC CCA ACC ATC AAG GCG ACT TAC ATG ATC 
CCG AAG TAA CTC GAG GCC AAG GCT TCC TAG TTC CCC TCA ATC TAC TAG 

2460 2470 2480 2490 

• • • * 

CCC CAT GTT GTG CAA AAA AGC CCT TAG CTC CTT CCC TCC TCC GAT CCT 
CGG CTA CAA CAC GTT TTT TCC CCA ATC GAG GAA GCC AGG AGG CTA GCA 

2500 2510 2520 2530 2540 

• • • • ♦ 

TCT CAG AAC TAA CTT GGC CCC ACT CTT ATC ACT CAT GGT TAT GCC AGC 
ACA GTC TTC ATT CAA CCC GCG TCA CAA TAG TGA GTA CCA ATA CCG TCG 

2550 2560 2570 2580 2590 

* * * * * 

ACT CCA TAA TTC TCT TAC TCT CAT GCC ATC CCT AAC ATC CTT TTC TCT 
TGA CCT ATT AAG AGA ATG ACA GTA CGG TAG GCA TTC TAC GAA AAG ACA 

2600 2610 2620 2630 2640 

• * • * * 

GAC TGG TGA GTA CTC AAC CAA GTC ATT CTG AGA ATA CTG TAT GCG GCG 
CTG ACC ACT CAT GAC TTC CTT CAG TAA GAC TCT TAT CAC ATA CGC CGC 
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FIG. 5 F 

2650 2660 2670 2600 2690 

* * * ♦ # 
ACC GAG TTG CTC TTG CCC GGC GTC AAC ACG GGA TAA TAC CGC GCC ACA 
TGG CTC AAC GAG AAC GGG CCG CAG TTG TGC CCT ATT ATG GCC CGG TGT 

2700 2710 2720 2730 

* * * * 

TAG CAG AAC TTT AAA ACT GCT CAT CAT TGG AAA ACG TTC TTC GGG GCG 
ATC GTC TTG AAA TTT TCA CGA GTA GTA ACC TTT TGC AAG AAG CCC CGC 

2740 2750 2760 2770 2780 

• * * * » 

AAA ACT CTC AAG GAT CTT ACC GCT GTT GAG ATC CAG TTC GAT GTA ACC 
TTT TGA GAG TTC CTA GAA TGG CGA CAA CTC TAG GTC AAG CTA CAT TGG 

2790 2800 2810 2820 2830 

* * * * * 

CAC TCG TGC ACC CAA CTC ATC TTC AGC ATC TTT. TAC TTT CAC CAG CCT 
GTG AGC ACG TGG CTT OAC TAG AAG TCG TAG AAA ATC AAA GTC GTC GCA 

2840 2850 2860 2870 2860 

* * * * + 
TTC TGG CTC AGC AAA AAC AGG AAG GCA AAA TGC CCC AAA AAA GGG AAT 
AAG ACC CAC TCG TTT TTG TCC TTC CGT TTT ACG GCC TTT TTT CCC TTA 

2890 2900 2910 2920 2930 

* * • • * 

AAG GGC GAC ACC GAA ATG TTG AAT ACT CAT ACT CTT CCT TTT TCA ATA 
TTC CCG CTG TCC CTT TAC AAC TTA TGA GTA TGA GAA GGA AAA AGT TAT 

2940 2950 2960 2970 

* * * * 

TTA TTG AAG CAT TTA TCA GGG TTA TTG TCT CAT GAG CGG ATA CAT ATT 
AAT AAC TTC GTA AAT AGT CCC AAT AAC AGA CTA CTC CCC TAT GTA TAA 

29B0 2990 3000 3010 3020 

• * • # * 

TGA ATG TAT TTA GAA AAA TAA ACA AAT AGG GCT TCC GCG CAC ATT TCC 
ACT TAC ATA AAT CTT TTT ATT TCT TTA TCC CCA AGG CGC GTG TAA AGG 

3030 3040 3050 3060 3070 

* * * # # 

CCC AAA AGT GCC ACC TGA CCT CTA AGA AAC CAT TAT TAT CAT GAC ATT 
GGC TTT TCA CGG TGG ACT GCA GAT TCT TTC GTA ATA ATA GTA CTG TAA 

3080 3090 3100 3110 3120 

• ■ • * * * 

AAC CTA TAA AAA TAG GCC TAT CAC GAG GCC CTG ATC GCT CTT TGC GGC 
TTC GAT ATT TTT ATC CCC ATA GTG CTC CGG GAC TAC CGA GAA ACG CCG 

3130 3140 3150 3160 3170 

* ♦ * ♦ * 

ACC CAT CCT TCG TAA TCT TCC CTC GCA CCC AGG ACA ACC CTC AAG AGA 
TCG GTA GCA AGC ATT ACA AGG CAC CCT GGC TCC TGT TGG GAG TTC TCT 

3180 3190 3200 3210 

* * * * 

AAA TCT AAT CAC ACT GGC TCA CCT TCG GCT GGG CCT TTC TCC GTT TAT 
TTT ACA TTA CTG TGA CCG AGT GGA AGC CCA CCC GGA AAG ACG CAA ATA 
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FIG. 5 G 
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* # • * # 

AAG GAG ACA CTT TAT GTT TAA GAA GGT TGG TAA ATT CCT TGC GGC TTT 
TTC CTC TGT GAA ATA CAA ATT CTT CCA ACC ATT TAA GGA AOC COG AAA 

3270 3280 3290 3300 3310 

* * * * * 

GGC AGC CAA GCT AGA GAT CCG GCT GTG GAA TGT GTG TCA GTT AGG GTG 
CCG TCG GTT CGA TCT CTA GGC CGA CAC CTT ACA CAC ACT CAA TCC CAC 

3320 3330 3340 3350 3360 

* * * * • 

TGG AAA GTC CCC AGG CTC CCC AGC AGG CAG AAG TAT GCA AAG CAT GCA 
ACC TTT CAG GGG TCC GAG GGG TCC TCC GTC TTC ATA CCT TTC CTA CGT 

3370 3380 3390 3400 3410 

* * * * * 
TCT CAA TTA GTC AGC AAC CAG GCT CCC CAG CAG GCA GAA CTA TGC AAA 
AGA GTT AAT CAG TCG TTG GTC CGA GGG GTC GTC CGT CTT CAT AGG TTT 

3420 3430 3440 3450 

* * * * 

CCA TCC ATC TCA ATT ACT CAG CAA CCA TAG TCC CGC CCC TAA CTC CCC 
CCT ACC TAG ACT TAA TCA GTC GTT GGT ATC AGG GCG GGG ATT GAG GCG 

3460 3470 3480 3490 3500 

* * • * * 

CCA TCC CGC CCC TAA CTC CGC CCA CTT CCG CCC ATT CTC CGC CCC ATG 
GGT AGG GCG GGG ATT GAG GCG GGT CAA GGC GGG TAA GAG GCG GGG TAC 

3510 3520 3530 3540 3550 

* * • * » 

GCT GAC TAA TTT TTT TTA TTT ATC CAG AGG CCG AGG CCG CCT CGG CCT 
CGA CTG ATT AAA AAA AAT AAA TAC CTC TCC GGC TCC GGC GGA GCC GGA 

3560 3570 3580 3590 3600 

* * • ♦ • 
CTC ACC TAT TCC AGA ACT ACT GAG GAG GCT TTT TTG GAG GCC TAG GCT 
CAC TCC ATA AGG TCT TCA TCA CTC CTC CGA AAA AAC CTC CCG ATC CGA 

3610 3620 3630 3640 3650 

• * ♦ * * 
TTT CCA AAA AGC TAG CTT GGG GCC ACC GCT CAG AGC ACC TTC CAC CAT 
AAA CCT TTT TCG ATC GAA CCC CGG TGC CGA GTC TCG TGG AAG GTG GTA 

3660 3670 3680 3690 

• • ♦ * 

GGC CAC CTC AGC AAG TTC CCA CTT GAA CAA AAA CAT CAA GCA AAT CTA 
CCG GTG GAG TCG TTC AAG GCT GAA CTT GTT TTT CTA GTT CCT TTA CAT 

3700 3710 3720 3730 3740 

* • # * * 

CTT CTC CCT CCC CCA GGG TCA GAA ACT CCA AGC CAT CTA TAT CTG GGT 
GAA CAC GGA CGG CCT CCC ACT CTT TCA GGT TCC GTA CAT ATA GAC CCA 

3750 3760 3770 3780 3790 

* • * * # 

TCA TGG TAC TCG AGA AGG ACT GCG CTC CAA AAC CCC CAC CCT GGA CTG 
ACT ACC ATG ACC TCT TCC TCA CCC GAC GTT TTG GGC GTG GGA CCT GAC 
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FIG. 5 H 



3800 3810 3820 3830 3840 

* # * • * 

TGA GCC CAA CTC TGT AGA AGA GTT ACC TGA CTC GAA TTT TGA TGG CTC 
ACT CCG GTT CAC ACA TCT TCT CAA TGG ACT CAC CTT AAA ACT ACC GAG 

3850 3860 3870 3880 3890 

* * # # * 

TAG TAC CTT TCA GTC TGA GGG CTC CAA CAC TGA CAT GTA TCT CAC CCC 
ATC ATG GAA ACT CAG ACT CCC GAG GTT GTC ACT GTA CAT AGA GTC GGG 

3900 3910 3920 3930 

* * * * 

TGT TCC CAT CTT TCC GGA CCC CTT CCG CAG AGA TCC CAA CAA GCT GGT 
ACA ACG GTA CAA AGC CCT GGG GAA GCC CTC TCT AGG GTT GTT CCA CCA 

3940 3950 3960 3970 3980 

• * ♦ * * 

GTT CTC TGA ACT TTT CAA GTA CAA CCG GAA GCC TCC ACA GAC CAA TTT 
CAA GAC ACT TCA AAA CTT CAT GTT GCC CTT CCG ACC TCT CTC CTT AAA 

3990 4000 4010 4020 4030 

* * * * ♦ 

AAG CCA CTC GTG TAA ACC GAT AAT GGA CAT GGT GAG CAA CCA CCA CCC 
TTC CGT GAG CAC ATT TGC CTA TTA CCT GTA CCA CTC GTT GGT CCT GGG 

4040 4050 4060 4070 4080 

* * * * * 

CTC GTT TCC AAT GGA ACA GGA GTA TAC TCT GAT GGG AAC AGA TGC GCA 
GAC CAA ACC TTA CCT TCT CCT CAT ATG AGA CTA CCC TTG TCT ACC CGT 

4090 4100 4110 4120 4130 

* * * • • 

CCC TTT TCC TTC CCC TTC CAA TCC CTT TCC TCC CCC CCA ACC TCC CTA 
CCC AAA ACC AAC CCC AAG GTT ACC GAA ACC ACC CCC CGT TCC ACC CAT 

4140 4150 4160 4170 

• * * * 

TTA CTC TCC TCT CCG CCC AGA CAA ACC CTA TCC CAC CCA TAT CCT CCA 
AAT GAC ACC ACA CCC CCC TCT CTT TCC GAT ACC CTC CCT ATA CCA CCT 

4180 4190 4200 4210 4220 

* • • • • 

GCC TCA CTA CCC CCC CTC CTT CTA TCC TCC CCT CAA GAT TAC ACC AAC 
CCG ACT CAT CGC CCC GAC GAA CAT ACC ACC CCA GTT CTA ATC TCC TTC 

4230 4240 4250 4260 4270 

♦ * ♦ * • 

AAA TGC TGA CCT CAT CCC TCC CCA CTC CCA ACT CCA AAT ACC ACC CTC 
TTT ACC ACT CCA CTA CCC ACC CCT CAC CCT TCA CCT TTA TCC TCC GAC 

4280 4290 4300 4310 4320 

* • * * » 

TCA ACC AAT CCC CAT CCC AGA TCA TCT CTC CCT CCC CCC TTT CAT CTT 
ACT TCC TTA CCC CTA CCC TCT ACT AGA GAC CCA CCC GCC AAA GTA GAA 

4330 4340 43S0 4360 4370 

* • • • * 

NCA TCC ACT ATC TGA AGA CTT TGG GGT AAT AGC AAC CTT TGA CCC CAA 
NGT AGC TCA TAC ACT TCT CAA ACC CCA TTA TCC TTG GAA ACT CCG GTT 
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FIG. 5 I 

4380 4390 4400 



4410 



^^!^T^^^ G7ATGGTCCAGGCTGCCA TAC CAA CTT TAG 
CGG GTA AGG ACC CTT GAC CTT ACC ACG TCC GAC OCT ATG GTT CAA ATC 



4420 4430 4440 4450 

* 



4460 



CAC CAA GGC CAT GCG GGA GGA GAA TGG TCT GAA GCA CAT CCA GCA GGC 
GTG GTT CCG GTA CGC CCT CCT CTT ACC AGA CTT OCT GTA GCT CCT CCG 

4470 4480 4490 4500 



4510 



CAT GGA GAA ACT AAG CAA GCG GCA CCG GTA CCA CAT TCG AGC CTA CXX 
GTA GCT CTT TGA TTC GTT CGC CGT GGC CAT OCT GTA AGC TCG GAT GCT 



4520 4530 4540 4550 



4560 



* 



TCC CAA GCG CGG CCT GGA CAA TGC CCG TCC TCT GAC TCC GTT CCA CBX 
AGG GTT CCC CCC GGA CCT GTT ACG GGC ACC AGA CPG ACC CAA GCT TCT 

4570 «80 4590 4600 4610 

* * * * 

^^^ aT ^^ OTTO ^^ OT ^CAATCX3CAG^ 
TTC CAC CTT GTA GTT GCT GAA AAG ACG ACC ACA GCG GTT AGC GTC ACG 

4620 4630 4640 4650 

* • # * 

CAG CAT CCG CAT TCC CCC GAC TCT CGC CCA GGA GAA GAA AGG TTA CTT 
GTC GTA GGC GTA AGC GGC CTG ACA GCC GCT CCT CTT CTT TCC AAT GAA 

4660 4670 46B0 4690 4700 



♦ 



TGA ACA CCG CGC CCC CTC TCC CAA TTC TGA CCC CTT TGC AGT GAC AGA 
ACT TCT GGC GCC GCG CAG ACG GTT AAC ACT GGC GAA ACC TCA CTC TCT 

4710 4720 4730 4740 4750 

♦ * « 

ACC CAT CCT CCC CAC ATC CCT TCT CAA TCA GAC TCG CCA CGA GCC CTT 

TCC GTA GCA GCC CTC TAC CGA AGA GTT ACT CTC ACC CGT CCT CCG GAA 

4760 4770 4780 4790 4800 

CCA ATA CAA AAA CTA ATT AGA CTT TCA GTC ATC TTC ACC CTT TCC TAC 
CGT TAT CTT TTT CAT TAA TCT GAA ACT CAC TAC AAC TCC CAA AGG ATC 

4810 4820 4830 4840 4850 

TTC ATC CCA CCC CCC CCC ACA GAG ATC TTT GTC AAC GAA CCT TAC TTC 
AAG TAC CGT CCC CCC CGG TCT CTC TAG AAA CAC TTC CTT CGA ATC AAG 

4860 4870 4880 4890 

• • ♦ • 

TCT CCT CTC ACA TAA TTC GAC AAA CTA CCT ACA GAC ATT TAA ACC TCT 

ACA CCA CAC TCT ATT AAC CTC TTT CAT GGA TCT CTC TAA ATT TCG ACA 

4900 4910 4920 4930 4940 

* • * 

AAC GTA AAT ATA AAA TTT TTA ACT CTA TAA TCT CTT AAA CTA CTC ATT 
TTC CAT TTA TAT TTT AAA AAT TCA CAT ATT ACA CAA TTT GAT GAC TAA 
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4950 4960 4970 4980 4990 

* * * • * 

CTA ATT GTT TGT GTA TXT TAG ATT CCA ACC TAT GGA ACT GAT GAA TGG 
GAT TAA CAA ACA CAT AAA ATC TAA GGT TGG ATA CCT TGA CTA CTT ACC 

5000 5010 5020 5030 5040 

• • • * * 

GAG CAG TGG TGG AAT GCC TTT AAT GAG GAA AAC CTG TTT TGC TCA GAA 
CTC GTC ACC ACC TTA CGG AAA TTA CTC CTT TTG GAC AAA ACG ACT CTT 

5050 5060 5070 5080 5090 

* ♦ * * » 

GAA ATG CCA TCT ACT GAT GAT GAG CCT ACT GCT GAC TCP CAA CAT TCT 
CTT TAC GCT AGA TCA CTA CTA CTC CGA TGA CCA CTG AGA CTT GTA AGA 

5100 5110 5120 5130 

* * # # 

ACT CCT CCA AAA AAG AAG AGA AAG GTA GAA GAC CCC AAG GAC TTT CCT 
TGA GGA GGT TTT TTC TTC TCT TTC CAT CTT CTG GGG TTC CTG AAA GGA 

5140 5150 5160 5170 5180 

• * * * * 

TCA GAA TTG CTA ACT TTT TTG ACT CAT GCT GTC TTT ACT AAT AGA ACT 
ACT CTT AAC GAT TCA AAA AAC TCA GTA CGA CAC AAA TCA TTA TCT TGA 

5190 5200 5210 5220 5230 

+ * * * • 

CTT GCT TGC TTT GCT ATT TAC ACC ACA AAG GAA AAA GCT CCA CTG CTA 
GAA CGA ACG AAA CGA TAA ATG TGG TGT TTC CTT TTT CGA CCT GAC GAT 

5240 5250 5260 5270 5280 

• • * ♦ • 
TAC AAG AAA ATT ATC GAA AAA TAT TCT GTA ACC TTT ATA ACT AGC CAT 
ATG TTC TTT TAA TAC CTT TTT ATA AGA CAT TGG AAA TAT TCA TCC GTA 

5290 5300 5310 5320 5330 

# * * * * 

AAC ACT TAT AAT CAT AAC ATA CTC TTT TTT CTT ACT CCA CAC AGG CAT 
TTG TCA ATA TTA GTA TTG TAT GAC AAA AAA GAA TGA GGT CTC TCC GTA 

5340 5350 5360 5370 

• • * • 

AGA GTC TCT GCT ATT AAT AAC TAT GCT CAA AAA TTG TGT ACC TTT AGC 
TCT CAC AGA CGA TAA TTA TTG ATA CGA GTT TTT AAC ACA TGG AAA TCC 

5380 5390 5400 5410 5420 

+ * • * * 

TTT TTA ATT TGT AAA GGG GTT AAT AAG GAA TAT TTG ATG TAT ACT GCC 
AAA AAT TAA ACA TTT CCC CAA TTA TTC CTT ATA AAC TAC ATA TCA CGG 

5430 5440 5450 5460 5470 

♦ * * * • 

TTG ACT AGA GAT CAT AAT CAG CCA TAC CAC ATT TGT ACA GGT TIT ACT 
AAC TGA TCT CTA GTA TTA GTC GGT ATG GTC TAA ACA TCT CCA AAA TGA 

5480 5490 5500 5510 5520 

• * * * # 

TGC TIT AAA AAA CCT CCC ACA CCT CCC CCT GAA CCT GAA ACA TAA AAT 
•ACG AAA TTT TTT GGA GGG TCT GGA GGG GGA CTT GGA CTT TGT ATT TTA 
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5530 5540 5550 5560 5570 

* * * * * 

GAA TCC AAT TCT TGT TGT TAA CTT CTT TAT TGC AGC TTA TAA TGG TTA 
CTT ACG TTA ACA ACA ACA ATT GAA CAA ATA ACG TCC AAT ATT ACC AAT 

5580 5590 5600 5610 

♦ * # ♦ 

CAA ATA AAG CAA TAG CAT CAC AAA TTT CAC AAA TAA AGC ATT TTT TTC 
GTT TAT TTC GTT ATC GTA GTG TIT AAA GTG TTT ATT TCG TAA AAA AAG 

5620 5630 5640 5650 5660 

♦ * * * * 

ACT GCA TTC TAG TTC TGG TTT GTC CAA ACT CAT CAA TGT ATC TTA TCA 
TGA CCT AAG ATC AAC ACC AAA CAG GTT TGA GTA GTT ACA TAG AAT ACT 

5670 5680 5690 5700 5710 

• # * * » 

TGT CTG GAT CTC TAG CTT CGT GTC AAG GAC GGT GAC TCC ACT GAA TAA 
ACA GAC CTA GAG ATC GAA GCA CAG TTC CTG CCA CTG ACG TCA CTT ATT 

5720 5730 5740 5750 5760 

* * * * * 

TAA AAT GTG TGT TTG TCC GAA ATA CGC GTT TTG AOA TTT CTG TCG CCG 
ATT TTA CAC ACA AAC AGG CTT TAT GCG CAA AAC TCT AAA GAC AGC GGC 

5770 5780 5790 5800 5810 

* * * * * 
ACT AAA TTC ATG TCC CGC GAT ACT GGT CTT TAT CGC CCA TAG AGA TGG 
TGA TTT AAC TAC AGC GCG CTA TCA CCA CAA ATA CCG CCT ATC TCT ACC 

5820 5830 5840 5850 

♦ * * ♦ 

CGA TAT TGG AAA AAT CCA TAT TTG AAA ATA TCG CAT ATT GAA AAT GTC 
GCT ATA ACC TTT TTA GCT ATA AAC TTT TAT ACC GTA TAA CTT TTA CAG 

5B60 5870 5880 5890 5900 

• ♦ ♦ * * 

GCC GAT CTC ACT TTC TCT GTA ACT CAT ATC CCC ATT TTT CCA AAA GTG 
CCG CTA CAC TCA AAC ACA CAT TGA CTA TAC CGC TAA AAA CGT TTT CAC 

5910 v 5920 5930 5940 5950 

* * * * * 

ATT TTT CGG CAT ACG CGA TAT CTC GCG ATA GCG CTT ATA TCC TTT ACC 
TAA AAA CCC GTA TGC GCT ATA GAC CCC TAT CCC GAA TAT AGC AAA TCC 

5960 5970 5980 5990 6000 

• • * ♦ * 

GGG CAT GGC GAT AGA CCA CTT TGC TGA CTT GGG CGA TTC TGT GTC TCC 
CCC CTA CCC CTA TCT GCT GAA ACC ACT GAA CCC GCT AAC ACA CAC AGC 

6010 6020 6030 6040 6050 

* * * * * 

CAA ATA TCC CAG TTT CCA TAT AGG TGA CAG ACC ATA TCA GGC TAT ATC 
GTT TAT ACC CTC AAA CCT ATA TCC ACT CTC TCC TAT ACT CCC ATA TAG 

6060 6070 6080 6090 

♦ • • • 

CCC GAT AGA CCC CAC ATC AAG CTG CCA CAT CGC CAA TCC ATA TCC ATC 
CGG CTA TCT CCC CTC TAG TTC GAC CCT CTA CCC GTT ACC TAT AGC TAG 
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6100 6110 6120 6130 6140 

TAT ACA TTG AAT CAA TAT TGG CCA TTA GCC ATA TTA TTC ATT GGT TAT 
ATA TGT AAC TTA CTT ATA ACC GGT AAT CCG TAT AAT AAG TAA CCA ATA 

6150 6160 6170 6180 6190 

***** 

ATA GCA TAA ATC AAT ATT GGC TAT TGG CCA TTC CAT ACG TTG TAT CCA 
TAT CGT ATT TAG TTA TAA CCG ATA ACC GGT AAC GTA TCC AAC ATA GGT 

6200 6210 6220 6230 6240 

* * * * • 

TAT CAT AAT ATG TAG AOT TAT ATT GGC TCA TGT CCA ACA TTA CCG CCA 
ATA GTA TTA TAC ATG TAA ATA TAA CCG ACT ACA GGT TGT AAT GGC GGT 

6250 6260 6270 62B0 6290 

* * * * I 

1^1 T£ ™ ™ TOA TO ACT *" TAT TAA TAG TAA TCA ATT ACG GGG 
ACA ACT GTA ACT AAT AAC TGA TCA ATA ATT ATC ATT ACT TAA TCC CCC 

6300 6310 6320 6330 

# * * » 

IS 2S ST ST CCA TAT ATG GAG TTC CCC CTT ACA TAA CTT ACG 

ACT AAT CAA GTA TCC GGT ATA TAC CTC AAC GCG CAA TGT ATT GAA TCC 

6340 6350 6360 6370 6380 

* * • * 

ST« CCC CCC CCA TTG ACG 

CAT TTA CCG GGC GGA CCG ACT GGC GGG TTG CTC GGG GCG GGT AAC TTC 

6390 6400 6410 6420 6430 

* * • « * 

TCA ATA ATG ACG TAT CTT CCC ATA GTA ACG CCA ATA GCG ACT TTC CAT 
ACT TAT TAC TGC ATA CAA GCG TAT CAT TCC GGT TAT CCC TGA AAG GTA 

6440 6450 6460 6470 6480 

* # * * » 

TCA CCT CAA TGG CTC GAG TAT TTA CCG TAA ACT GCC CAC TTC CCA GTA 
ACT CCA CTT ACC CAC CTC ATA AAT CCC ATT TGA CCG CTC AAC CCT CAT 

6490 6500 6510 6520 6530 

* ♦ • * * 

CAT CAA CTC TAT CAT ATC CCA ACT ACC CCC CCT ATT GAC CTC AAT CAC 
CTA CTT CAC ATA GTA TAC CCT TCA TCC CCG CCA TAA CTC CAC TTA CTC 

6540 6550 6560 6570 

• ♦ * • 

CCT AAA TCC CCC CCC TCC CAT TAT GCC CAC TAC ATC ACC TTA TGG GAC 
CCA TTT ACC GGC CCC ACC CTA ATA CCG CTC ATG TAC TGC AAT ACC CTC 

6580 6590 6600 6610 6620 

• * # « • 

TTT CCT ACT TGG CAG TAC ATC TAC GTA TTA CTC ATC CCT ATT ACC ATC 
AAA GCA TGA ACC CTC ATC TAC ATC CAT AAT CAC TAG CCA TAA TGG TAC 

6630 6640 6650 6660 6670 

* * • ♦ * 

CTC ATC CCC TTT TCC CAC TAC ATC AAT CCG CGT GGA TAG CCG TTT GAC 
CAC TAC CCC AAA ACC CTC ATG TAG TTA CCC GCA CCT ATC GCC AAA CTC 

RECTIFIED SHEET (RULE 91) 

ISA/EP 



WO 96/40921 



PCT/US96/09287 



35/41 

FIG. 5 M 



6680 6690 6700 €710 6720 

* * * • * 

TCA CCG CCA TIT CCA AGT CTC CAC CCC ATT GAC GTC AAT CCC ACT TTG 
ACT GCC CCT AAA CGT TCA GAG GTC GGG TAA CTG CAC TTA CCC TCA AAC 

6730 6740 6750 6760 6770 

* * # * # 

TTT TGG CAC CAA AAT CAA CGG GAC TTT CCA AAA TGT CGT AAC AAC TCC 
AAA ACC GTG GTT TTA GTT GCC CTG AAA GGT TTT ACA GCA TTG TTG AGG 

6780 6790 6800 6810 

* * * ♦ 

GCC CCA TTG ACG CAA ATG GGC GGT AGG CGT GTA CGG TGG GAG GTC TAT 
CGG GGT AAC TCC GTT TAC CCG CCA TCC GCA CAT GCC ACC CTC CAG ATA 

6820 6830 6840 6850 6860 

* * * * * 

ATA AGC ACA CCT CGT TTA GTC AAC CGT CAG ATC GCC TGG AGA CGC CAT 
TAT TCC TCT CCA GCA AAT CAC TTG GCA GTC TAG CCG ACC TCT GCC GTA 

6870 6880 6890 6900 6910 

* * * * ♦ 

CCA CCC TCT TTT GAC CTC CAT AGA AGA CAC CGG GAC CGA TCC AGC CTC 
GGT GCC ACA AAA CTG GAG GTA TCT TCT CTG GCC CTG GCT AGG TCG GAG 

6920 6930 6940 6950 6960 

* * * * * 

CGC GGC CCG GAA CGG TCC ATT GGA ACG CCG ATT CCC CCT GCC AAG AGT 

GCC CCG GCC CTT GCC ACC TAA CCT TCC CCC TAA GGG GCA CGG TTC TCA 

G970 6980 6990 7000 7010 

* ♦ ♦ * # 

GAC GTA AGT ACC GCC TAT AGA GTC TAT AGG CCC ACC CCC TTG CCT TCT 
CTC CAT TCA TGG CCC ATA TCT CAG ATA TCC GGC TCG GGG AAC CCA AGA 

7020 7030 7040 7050 

• • • ♦ 

TAT CCA TCC TAT ACT CTT TTT CCC TTC CGG TCT ATA CAC CCC CCC TTC 
ATA CCT ACC ATA TGA CAA AAA CCC AAC CCC AGA TAT GTG GGG GCC AAG 

7060 7070 7080 7090 7100 

• • • * * 

CTC ATC TTA TAG GTC ATC GTA TAG CTT AGC CTA TAC CTC TCC CTT ATT 
GAC TAC AAT ATC CAC TAC CAT ATC CAA TCG GAT ATC CAC ACC CAA TAA 

7110 7120 7130 7140 7150 

• • • * * 

GAC CAT TAT TCA CCA CTC CCC TAT TCC TCA CCA TAC TTT CCA TTA CTA 
CTC CTA ATA ACT CGT GAG GGC ATA ACC ACT CCT ATC AAA GCT AAT GAT 

7160 7170 7180 7190 7200 

* • « * * 

ATC CAT AAC ATG GCT CTT TCC CAC AAC TCT CTT TAT TGG CTA TAT CCC 
TAC CTA TTC TAC CCA CAA ACC CTC TTC AGA GAA ATA ACC CAT ATA CCC 

7210 7220 7230 7240 7250 

* * * * . • 
AAT ACA CTC TCC TTC AGA GAC TGA CAC GGA CTC TGT ATT TTT ACA GGA 
TTA TGT GAC AGG AAG TCT CTG ACT GTG CCT GAG ACA TAA AAA TGT CCT 
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7260 7270 7280 

* * • 



7290 



TGG OCT CTC ATT TAT TAT TTA CAA ATT CAC ATA TAC AXC Xrn m ^ 
ACC CCA GAG TAA ATA AIA AAT CTT TAX CTG TAT ATG TTC TOG TGG CAG 

7300 73 " 7320 7330 73 < 0 

* * * 

CCC ACT CCC CGC ACT TTT TAT TAA ACA TAA CCT GGG ATC TCC xrr ~, 
GGG TCA CGG GCG TCA AAA ATA ATT TGT ATT GCA CCC TAG ACC TCC OCT 

7 «0 7360 7370 7380 7 390 



ATC TCC GGT ACC TGT TCC GGA CAT GGG CTC TTC TCC OCT »nr> 

?40 2 741 ° 7 * 2 J 7430 74 < 0 

GCT TCT ACA TCC CAG CCC TGC TCC CAT GCC TCC ACC rxr< * 

7 «° 7 «° 7470 7480 7 < 90 

AGC TCC TTC CTC CTA ACA GTG GAG OCC ACA CTT ACS rxn »w» 

750 ° 7S10 7S20 7 530 
ACC ATC CCC ACC ACC ACC AGT CTC CCC CAC AAG CCC err* rw» 

75 " ?55 ° 756 ° 7570 

TAT GTG TCT GAA AAT GAG CTC CCC GAS CGG GCT Tec »rv» 

ATA CAC AGA CTT TTA CTC GAG CCC CTC GCC CGA ACC TGG CCA CTC CCT 

7590 7600 7610 7620 7630 

* ♦ • 

TTT GGA AGA CTT AAG GCA GCG GCA GAA GAA GAT GCA GCC AGC TGA CTT 
AAA CCT TCT GAA TTC CCT CCC CCT CTT CTT CTA CGT CCG TCG ACT CAA 

7640 7650 7660 7670 76B0 



• 



CTT CTC TTC TCA TAA GAG TCA GAG CTA ACT CCC CTT CCG GTG CTC TTA 
CAA CAC AAG ACT ATT CTC AGT CTC CAT TGA GGG CAA CGC CAC GAC AAT 

7690 7700 7710 7720 7730 

ACG GTG GAG CGC AGT CTA CTC TGA GCA CTA CTC CTT GCT GCC GCC CCC 
TGC CAC CTC CCC TCA CAT CAG ACT CGT CAT GAC CAA CGA CCG CGC GCC 

7740 7750 7760 7770 

• • • 

CCC ACC AGA CAT AAT AGC TCA CAG ACT AAC AGA CTC TTC CTT TCC ATC 
CGG TGG TCT CTA TTA TCG ACT CTC TGA TTC TCT GAC AAG GAA AGG TAC 

7780 7790 7800 7810 7820 

GGT CTT TTC TGC AGT CAC CCT CCT TGA CAC GAA GCT TGG GCT CCA CGT 
CCA GAA AAG ACC TCA CTG GCA CGA ACT GTG CTT CGA ACC CGA CCT CCA 
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7830 7840 7850 7860 

* • * • 

OCX TCG ACT CTA GAG GAT CCA TCC CCC GGC GAG CTC O 
CCT ACC TQA GAT CTC CTA OCT AGG GGC CCG CTC GAG C 
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